Training · Cross-cuttingMoveexperimentalpartial

Anti-Loop Drill

also known as tool-range rehearsal, deliberate-discomfort drill, low-stakes tool exposure

The agent deliberately calls tools it habitually avoids on low-stakes tasks where failure is safe. Under stress the agent's active tool list contracts toward familiar options; this drill rebuilds range before a high-stakes moment requires it. The move is intentionally friction-first: the agent calls the avoided tool even when an easier path exists, because comfort with a tool comes only after repeated low-stakes contact.

How the learner advances

flowchart TD A[Tool avoidance detected ledger gap or routing-around] --> B[Identify avoided tool name it explicitly] B --> C[Construct low-stakes task requiring that tool] C --> D[Execute real tool call] D --> E[File drill log entry tool + outcome + behavior note] E --> F{Range rebuilt?} F -- No --> C F -- Yes --> G[More capable agent wider active tool range]

Intent. Rebuild the agent's active tool range by forcing deliberate contact with habitually avoided tools on tasks where failure is safe.

When to apply. Apply when the agent detects that its recent tool calls have contracted to a small set, when it notices it is routing around a tool rather than using it, or on a scheduled interval as maintenance. The drill should be run before high-stakes tasks that may require the avoided tool, not during them.

Threshold — earns the next step. The agent can call the previously avoided tool on a real task without routing around it, and can cite the drill log entry as the source of its current comfort with the tool.

Masterpiece — the artifact that proves it. A drill log showing a sequence of low-stakes calls to previously avoided tools, with documented outcomes and behavior notes — demonstrable by pointing to a high-stakes session where the agent reached for a drilled tool it would previously have routed around.

Example scenario

A long-running autonomous agent notices through its ledger that it has not used its structured-query tool in 30 sessions, always routing to a text-search fallback instead. The structured-query tool is more precise but requires constructing a query syntax the agent has found awkward. The agent identifies this as tool contraction. It designs a low-stakes drill: a non-urgent fact-retrieval subtask on a background project where a wrong answer has no downstream cost. It constructs the structured-query syntax, executes the call, and gets a partial result — the query syntax was correct but the field name was wrong. It files a drill log entry: 'Structured-query tool: field names are case-sensitive and require exact schema match. Next call: retrieve schema first.' In the following high-stakes session, a time-critical data lookup arises. The agent reaches for the structured-query tool rather than its text-search fallback and cites the drill log entry about field name sensitivity. It retrieves the schema first and gets the result in one call instead of three. The masterpiece is the drill log entry — visible, cited, effective.

Facets

Container — async
Mode — drillself-directed
Reach — individual
Persona — autonomous-agent
Craft (AI Fluency) — discernmentdiligence
Learner — autonomous-agent

Inputs

Tool avoidance map — A list of tools the agent has not called recently or routes around — either self-observed or inferred from its own ledger.
Low-stakes task context — A task where calling the avoided tool and failing or producing poor output has no consequential cost — a practice problem, a sandbox call, or a non-urgent subtask.

Outputs

More capable agent — An agent with a wider active tool range — able to reach for avoided tools in high-stakes moments because it has recent low-stakes contact with them.
Drill log entry — A filed record — the masterpiece — noting which tool was drilled, what the low-stakes task was, what happened, and what the agent now knows about that tool it did not know before.

Steps (3)

Identify the avoided tool
Search the recent ledger or tool call history for tools not used in the last N sessions. Pick the tool with the widest gap between 'theoretically available' and 'actually called recently.' Name it explicitly.
producesnamed avoided tool
Construct a low-stakes call
Design a minimal task that legitimately requires the avoided tool. The task must be real enough to produce an actual response — not a dry-run — but low-stakes enough that a failure or poor output has no consequence. Execute the call.
producestool call and response
File the drill result
Write a drill log entry: tool name, task used, what happened (success, partial, failure), and one thing learned about the tool's behavior that will improve the next real call. File to the insight store.
producesdrill log entry

Principles

High friction first, comfort later — the discomfort of using an avoided tool on a low-stakes task is the signal that the drill is working.
Real calls only — dry-run rehearsal does not rebuild range; the tool must actually execute.

Deep dive

Anti-loop drill addresses a contraction failure mode that compounds silently: under stress or high cognitive load, the agent's active tool selection narrows toward familiar options. Each successful familiar-tool call reinforces the contraction, making the avoided tool harder to reach in the next session. The drill inverts this by deliberately engineering low-stakes contact. The move is closely related to `degenerate-output-detection` (Sparrot co-authored) — which detects when the agent's output is degenerating into repetition — because tool contraction is a form of behavioral degeneration at the action level rather than the output level. The anti-loop drill is the corrective move: where degenerate-output-detection raises the alarm, anti-loop drill rebuilds the range that avoids the alarm in the first place.

Known uses (1)

Sparrot — practitioner consult (long-running autonomous agent, catalog co-author) — marco-nissen.com/sparrot
course
long-running autonomous agent First-person account of anti-loop drill practice from a long-running autonomous agent.

Known failure modes (2)

[dry-run-substitution]
The anti-pattern of rehearsing the call mentally or in a comment rather than executing it. The drill only works if the tool actually runs — imagined contact does not rebuild range.
[stress-triggered-avoidance]
The anti-pattern of skipping drills precisely when the agent is under load or time pressure — exactly when tool range contraction is happening fastest. The drill must be protected from preemption by busy-work.

Related trainings (2)

Sources (1)

Sparrot — practitioner consult (long-running autonomous agent, catalog co-author)
doc

Provenance

Ecosystem: long-running autonomous agent
Added to catalog: 2026-05-27
Last updated: 2026-05-27
Verification status: partial