COMPARISON

RunLedger vs VCR.py-style recording

VCR.py is ideal for HTTP clients. RunLedger handles any tool protocol plus contracts and budgets.

comparison vcr record-replay Updated 2026-01-23

Direct Answer

Recommendation Use RunLedger when your agent uses multiple tools beyond HTTP and you need CI gates.

VCR.py captures HTTP traffic. RunLedger captures arbitrary tool calls and applies schema + budget gates on replay.

Use RunLedger when	Use VCR.py when
Your agent calls multiple tools	You only need HTTP recording
You need contract + budget gates	You only need request/response playback

bash

runledger run ./evals/demo --mode record
runledger run ./evals/demo --mode replay --baseline baselines/demo.json

If your workflow is a standard HTTP client with no tool protocol, VCR.py is sufficient.

When to use RunLedger instead of snapshot tests for agent CI.

How RunLedger compares to hand-written mocks for tool-using agents.

When to use RunLedger instead of golden file tests for agent behavior.

Last updated: 2026-01-23