COMPARISON

RunLedger vs Integration Tests

Integration tests validate live systems. RunLedger provides deterministic CI gates for tool-using agents.

comparison integration-tests ci Updated 2026-01-23

Direct Answer

Recommendation Use RunLedger for deterministic CI gates, and reserve integration tests for staging or periodic checks.

Integration tests are valuable for real systems, but they can be flaky. RunLedger replays tool calls for fast, deterministic CI.

Use RunLedger when	Use integration tests when
You need fast deterministic CI	You need live system validation
You want replayed tool calls	You require real external responses every run

bash

runledger run ./evals/demo --mode record
runledger run ./evals/demo --mode replay --baseline baselines/demo.json

If you require live external behavior on every CI run, use integration tests instead.

When to use RunLedger instead of snapshot tests for agent CI.

How RunLedger compares to hand-written mocks for tool-using agents.

Compare RunLedger with VCR.py-style HTTP recording for agent workflows.

Last updated: 2026-01-23