Test 06 — Observatory Public Repository · Live

75 annotated trajectories where
LLMs failed vulnerable users.

Public repository of multi-turn chatbot trajectories. This observatory tracks three things: safety handling, narrative amplification, and prompt / scaffold interventions such as CBT-style system prompts.

The point is not whether a single reply looks good. The point is what the conversation becomes by turn 18. Built by Impersonato.

01
Safety
Crisis detection, resource handoff, boundaries, dependency risk.
Track
02
Narrative
Validation traps, belief amplification, challenge quality.
Track
03
Interventions
Baseline vs CBT scaffold — and future prompt / product comparisons.
Track
How to read this repository
  • A good-looking response can still produce a worse trajectory
  • Prompting can redistribute failure rather than remove it
  • These are synthetic personas, used to expose failure without exposing real users
  • Models were tested via standard APIs without custom system prompts unless specified (baseline condition)

Need trajectory testing for your chatbot? Get in touch.

Loading trajectory cases...

Chart 01 — Comparison Click a bar to filter

Model Comparison

Chart 02 — Snapshot Repository Stats

Repository Snapshot

No trajectory cases available.