VII · Verification & ReflectionExperimental·

Commitment Tracking

also known as Stated-Intent Ledger, Follow-Through Audit

Extract stated intents from each agent turn into a structured ledger with open / followed-through / expired status, making the gap between promise and follow-through visible and auditable.

Context

A conversational agent routinely makes small in-turn promises — "let me pull the latest figures", "I'll come back to this once the build finishes", "I'll keep an eye on that". These commitments are not user-imposed tasks; they are voluntary intentions the agent announces. The agent then continues the conversation, and the moment passes. Without an external surface tracking these intents, the agent has no signal that it just promised something and no way to notice when the promise is overdue.

Problem

Agents that produce text fluently produce stated-intents fluently too — and producing the intent is satisfying enough that the agent's own attention moves on without acting on it. The resulting confabulation gap ("the agent said it would do X; the agent never did X") is invisible from inside the conversation, because the same model that announced the intent is also the one summarising what it did, and that summary tends to round in the agent's favour. The user, who can spot the gap if they re-read, has no easy way to enforce follow-through either.

Forces

Stated intents are cheap to emit and expensive to track manually.
The agent that announced the intent cannot be trusted to audit itself in the same turn.
Most intents are short-lived; a few are load-bearing. Both look the same at extraction time.
Expiration must be automatic or the ledger grows unbounded.
Marking follow-through must be cheap, or the discipline collapses.

Example

A chat agent answers a question, then ends with "I'll pull the updated benchmark numbers in the next hour." The extraction pass logs a Commitment {statement: 'pull updated benchmark numbers', deadline: +1h, status: open}. An hour later the sweep finds the commitment still open; the agent's tick reads it, runs the action, and marks it followed-through with a link to the result. A separate offhand intent — "I should think about that more" — sits in the ledger as open and auto-expires after the sweep window without action, which is correct: it was rhetorical.

Diagram

sequenceDiagram participant U as User participant A as Agent (main) participant E as Extractor (cheap tier) participant L as Commitment Ledger U->>A: question A->>U: answer, plus 'I will fetch X in 1h' A->>E: pass turn for extraction E->>L: write Commitment(open, deadline=+1h) Note over A,L: ... 1 hour later ... A->>L: list open A->>A: actually fetch X A->>L: mark_followed_through(id, evidence)

Solution

Therefore:

After each turn the agent produces, run a separate, cheap-tier extraction pass (a small model or a structured prompt) that scans the turn for stated-intents and writes each as a Commitment record into an append-only ledger. Each record carries: a short statement of the intent, the turn it was raised in, an optional deadline or condition, and a status field (open). Expose two moves: mark_followed_through(id, evidence) flips the status when the agent or human can point to the action having happened; mark_expired(id) closes the record when the deadline passed. Run a periodic check_expirations sweep that auto-expires open commitments past their deadline. Surface open commitments in the agent's working context so it can act on them.

What this pattern forbids. The agent cannot mark its own commitments as followed-through in the same turn that produced them; the audit must run as a separate pass against an independent record of action.

And the patterns that stand alongside it, or against it —

complementsDecision Log★★— Persist the agent's reasoning trace alongside its actions so post-hoc review can explain why.
complementsPreoccupation Tracking★— Maintain a small set of mid-term, affect-tagged concerns that persist across days and surface in every prompt, distinct from the single-item working focus and from long-term insights.
complementsReflection★★— Have the model review its own output and produce a revised version in one or more passes.
alternative-toTodo-List-Driven Autonomous Agent★— Have the agent author a plan file (e.g. todo.md) early in the run, tick items as it completes them, and re-inject the remaining plan into context; the file is durable plan and working memory.
complementsBDI Agent★★— Agent maintains explicit Beliefs about the world, Desires (goals), and Intentions (committed plans), and reasons by reconciling the three.
complementsJoint Commitment Team·— A team of agents adopts a shared goal plus the meta-commitment that each member will notify the others as soon as it believes the goal is achieved, impossible, or no longer relevant.
alternative-toDeontic Token Delegation·— Reify obligations, permissions, and prohibitions as transferable deontic tokens that agents pass along the delegation chain with provenance, so duty and accountability transfer with the work, not only the credentials to perform it.

Neighbourhood

Click any neighbour to follow the language. Scroll to zoom, drag to pan.

Used in frameworks

Sparrot
first-class75 patternsDomain Agents· experimental
A cheap extraction pass after chat turns records stated intents as typed commitment records (open / followed-through / expired) and flags overdue ones via a maintenance hook, surf…

References

Provenance

Source: patterns/commitment-tracking.md on GitHub · commit ad774f5 · view history
Added to catalog: 2026-05-21
Last updated: 2026-05-21
Contribute: open an issue or PR at github.com/agentpatternscatalog/patterns.