Token logprobs and brittle spans

A small AI lab in public. Notes, tools, and experiments.

Model confidence often collapses around instruction edges. Token logprobs expose these brittle spans. A heatmap shows which phrases flip outcomes when wording shifts slightly.

Quick recipe

Request token logprobs for a batch of realistic prompts.
Highlight tokens around instructions and constraints.
Compare spans across variants to see where confidence dips.