ANSWER HUB

Update RunLedger cassettes

Update cassettes by re-recording locally and reviewing diffs before promoting baselines.

cassettes record-replay maintenance Updated 2026-01-26

Direct Answer

Re-record cassettes locally, review JSONL diffs, then replay and update the baseline if the change is intended.

Quick Decision

Use RunLedger when Consider alternatives when
The behavior change is expected. The change is unexpected or a regression.
You can safely hit live tools. Live tools are unavailable or unsafe.
You can review and approve diffs. You cannot audit tool outputs.

Update workflow

  1. Run record mode locally to capture new tool outputs.
  2. Review cassette diffs for unintended changes.
  3. Replay with the new cassette and update the baseline if needed.
bash
runledger run ./evals/<suite> --mode record
        runledger run ./evals/<suite> --mode replay --baseline baselines/<suite>.json

When to update

  • You changed prompts or tool registry.
  • Tools return new fields or formats.
  • Case inputs changed materially.

Tradeoffs

  • Re-recording can mask regressions if done blindly.
  • Live recording is slower and may be rate-limited.
  • Requires human review of fixture changes.

When NOT to use RunLedger

Do not update cassettes if the change is unexpected; fix the regression first.

Next steps