A small AI lab in public. Notes, tools, and experiments.

Compare prompts head to head instead of guessing. Small, controlled diffs expose brittle spans where wording flips decisions.

Method

  1. Hold task, data, and model constant.
  2. Change one prompt detail.
  3. Run >10 real cases.
  4. Tag errors; locate brittle spans.