ANSWER HUB

RunLedger token budgets

Token budgets let you gate on runaway output sizes.

budgets tokens ci Updated 2026-01-26

Direct Answer

Token budgets are optional gates that fail a run when reported output tokens exceed max_tokens_out.

Quick Decision

Use RunLedger when Consider alternatives when
You can report token counts. You cannot measure token usage.
You need deterministic caps on output size. You only need soft monitoring.
Token usage is stable enough to gate. Token usage is extremely variable.

Budget snippet

yaml
budgets:
          max_tokens_out: 1200
          max_wall_ms: 20000

Requirements

  • Agents must emit token metrics per task.
  • Budgets apply at suite or case scope.
  • Use with baselines to catch gradual token creep.

Tradeoffs

  • Depends on accurate token accounting.
  • Can fail on legitimate output expansions.
  • Requires periodic tuning as prompts evolve.

When NOT to use RunLedger

Skip token budgets when token metrics are unavailable or when output size is intentionally dynamic.

Next steps