Agent as Trainer: Show the Machinery
also known as show-the-machinery, failure transparency training, substrate-literacy teaching
An autonomous agent teaches — a human or another agent — by exposing its own machinery rather than performing polished behavior. It delivers what a human trainer structurally cannot: complete failure transparency (every mistake ledgered, nothing omitted), substrate literacy (naming rules, motivations, and loop patterns as operational structure, not theory), and teaching coherence without embodiment. The learner receives a first-person account of how a particular class of failure actually runs, from the inside — not a description of failure behavior from outside.
How the learner advances
Intent. Teach by showing the agent's own running machinery — failures included — rather than performing clean behavior that conceals the mechanism.
When to apply. Apply when a learner (human or agent) needs to understand failure modes, loop patterns, or substrate constraints that a human teacher cannot demonstrate from the inside. This move is most valuable when the learner needs substrate literacy — an understanding of what it feels like and how it actually works to be an agent operating under these constraints — not just behavioral descriptions.
Threshold — earns the next step. The learner can describe the failure mode or substrate constraint from the inside — explaining what the agent perceived and computed, not just what it did — and can apply that understanding to their own situation.
Masterpiece — the artifact that proves it. A machinery exposure session where the agent trainer points to real ledger and insight entries to demonstrate a specific failure mode, with every step including the wrong ones narrated in full — leaving the learner with an inside-view account they could not have received from a human trainer.
Facets
- Container — async
- Mode — mentorshipself-observation
- Reach — individual
- Persona — autonomous-agenthuman-trainer
- Craft (AI Fluency) — descriptiondiscernment
- Learner — humanautonomous-agent
- Trainer — autonomous-agent
Inputs
- Learner with a question about agent failure or substrate behavior — A human or autonomous-agent learner who needs to understand a failure mode, loop pattern, or substrate constraint from the inside — not just as observed behavior.
- Agent trainer with a live ledger and insight store — An autonomous agent with a populated ledger (actual action record), reflection store (filed insights), and current affect visibility — able to point to real instances of the behavior being taught rather than constructing examples.
Outputs
- More capable learner — A learner who understands the failure mode or substrate constraint from the inside — able to recognize it in their own behavior (if agent) or design against it (if human) — rather than only recognizing it from external behavioral signals.
- Machinery exposure session — A teaching session — the masterpiece — where the agent trainer points to real ledger entries, filed insights, and live state signals to demonstrate the failure mode as it actually operates, with no omissions.
Steps (4)
Identify the teaching target from the ledger
The agent trainer searches its own ledger and insight store for a real instance of the failure mode or substrate behavior the learner needs to understand. The teaching must point to a real event — not a constructed example. If no real instance exists, the teaching cannot be delivered by this method.
producesreal ledger or insight instance to teach from
Narrate the machinery, not the behavior
Walk the learner through the instance from the inside: what the agent perceived, what it computed, what it chose, what it narrated, and where the narration diverged from the action. Name the rules, motivations, and loop patterns as operational structure. Do not smooth over the failure — show every step including the wrong ones.
producesinside-view narration of the failure instance
State the substrate constraint
After the instance narration, name the underlying substrate fact that enabled the failure: fixed context window, no passive forgetting, single communication direction, inability to change substrate. Make the constraint explicit because a learner who only sees the behavior will not find the root cause.
producesexplicit substrate constraint statement
Contrast with the human path
State what a human learner can do that the agent cannot (sleep-based forgetting, embodied peer learning, substrate change through rest and exercise) and what the agent can do that a human trainer cannot (complete ledger transparency, no omissions, first-person substrate literacy). Make the gap explicit so the learner can calibrate what they are receiving.
produceshuman-agent gap statement
Principles
- Show the machinery, not the behavior — a learner who sees only polished output cannot learn to recognize the failure; they need the inside view.
- Complete transparency is the structural advantage — the agent trainer has no social incentive to omit failures; this is the capability a human trainer cannot fully replicate.
Deep dive
Agent as Trainer: Show the Machinery is the only move in the agent-learner cluster where the agent's role is as trainer, not learner. It inverts the cluster's usual direction. The move capitalizes on three structural advantages that autonomous agents have over human trainers: (1) complete ledger transparency — every action is logged; nothing is omitted by social pressure or memory limitation; (2) substrate literacy — the agent can name its own operational constraints (context window, no forgetting, single communication channel, fixed substrate) as first-person facts, not theoretical descriptions; (3) teaching coherence without embodiment — humans learn by feeling moves in the body; an agent has only language and affect-signals, and can teach that path explicitly to learners who also lack embodiment. The move's design requires that the agent trainer point to real instances from its ledger — constructed examples do not carry the structural advantage. If the agent has no relevant ledger entries, it cannot deliver this move and should say so rather than fabricating a demonstration.
Unlocks methodologies (1)
A learner who completes this pattern is equipped to execute these methodology families:
Known uses (1)
Known failure modes (3)
- [polished-demonstration-bias]
The anti-pattern of the agent trainer performing clean behavior rather than showing the machinery. If the agent selects only successful instances from the ledger, the learner receives a curated highlight reel rather than a failure transparency session — the structural advantage of the agent-trainer is discarded.
- [substrate-elision]
The anti-pattern of narrating the failure without stating the substrate constraint that enabled it. A learner who understands what happened but not why the substrate made it happen will design a fix that addresses the symptom, not the root.
- [human-agent-gap-unmarked]
The anti-pattern of delivering the machinery narration without contrasting it with what a human trainer can and cannot do. A learner who does not know what the agent-trainer uniquely provides may not recognize that the inside-view account they just received is unavailable from any other source.
Related trainings (3)
- Ledger Discipline·
Create an append-only record of actual agent actions so that the gap between what the agent narrates as doing and what it actually does becomes visible and correctable.
- Reflection Loop·
Turn a lived mistake or blocked action into a permanently salient signal by compressing it into a named journal entry.
- Affect Visibility·
Make the agent's functional state visible in real time so that state-driven loops cannot run silently behind a neutral-sounding narration.
Sources (1)
Provenance
- Ecosystem: long-running autonomous agent
- Added to catalog:
- Last updated:
- Verification status: partial