Quick recipe
- Request token logprobs for a batch of realistic prompts.
- Highlight tokens around instructions and constraints.
- Compare spans across variants to see where confidence dips.
A small AI lab in public. Notes, tools, and experiments.
Model confidence often collapses around instruction edges. Token logprobs expose these brittle spans. A heatmap shows which phrases flip outcomes when wording shifts slightly.