Methodology · Iteration Managementprovenverified

Crawl-Walk-Run Automation Gating

also known as progressive autonomy, autonomy tiers

Applies to: agentcoding-agentmulti-agent-system

Tags: autonomy-tiersprogressive-rolloutgating

Roll an agent out in three stages, with a clear gate between each one. In the first stage the agent only suggests, and a person acts. In the second stage the agent acts on internal staff, who can fix mistakes. In the third stage the agent acts directly on outside customers. Each stage is set per action type, not for the whole agent. The same agent can be in the last stage for safe read-only actions and the first stage for refunds. To move up a stage, an action type must clear a published metric bar. If its numbers drop, it moves back down on its own.

Methodology process overview

Intent. Separate what an agent can do from what it is allowed to do on its own. A system that could plausibly act gets to act only after the data earns it, one action type at a time.

When to apply. Use this for any agent that could plausibly act on its own in ways customers feel. Examples are replying to tickets, refunding orders, sending outbound messages, or changing production resources. The right fit is when one bad action does real harm and there is no reliable way to undo it. Don't apply it for read-only or sandboxed agents, where a bad action causes no harm.

Inputs

  • Catalog of action typesA list of every distinct action the agent can take. Each one has its own level of risk if it goes wrong.
  • Promotion bar per tierPublished targets an action type must hit to move up a stage. These cover human acceptance, internal completion, and customer outcomes.
  • Per-action-type metric pipelineLogging that records, for every action, which stage it ran at and what happened next.

Outputs

  • Per-action-type tier assignmentA live map from each action type to its current stage (Crawl, Walk, or Run).
  • Promotion logA record you can audit. It shows which action types moved up or down, when, and on what evidence.

Steps (6)

  1. Enumerate action types

    List every distinct action the agent can take. 'Reply to ticket', 'issue refund up to $50', 'issue refund above $50', and 'escalate to human' are four separate types, not one.

  2. Publish the metric bar per tier

    For each stage, write down the metric and the target an action type must hit to move up. Use acceptance rate, completion rate, or customer outcome. Also set how long each stage must hold before you consider promotion.

  3. Start every action type at Crawl

    On day one, no action type starts in Walk or Run, no matter how it was built. In Crawl the agent only suggests. A human accepts, rejects, or edits each suggestion.

  4. Promote one action type at a time

    When an action type clears the Crawl-to-Walk bar, move up that one type only. The agent now acts on internal staff for that action. Everything else stays in Crawl.

  5. Watch for regression and auto-demote

    If a stage's metric drops below its bar, move that action type back down. Use a small margin so it does not bounce up and down. A demotion is not a failure. It is the system working as designed.

  6. Advance to Run with the customer-outcome metric

    Moving from Walk to Run needs more than internal acceptance. It needs a real customer outcome, such as a ticket that stays resolved, a refund that is not reversed, or a message that is not flagged. Internal acceptance is not the same as customer success.

Framework-specific instructions

Pick a framework and generate a framework-targeted rewrite of this methodology's steps.

Choose framework

AI-generated for Agent Development Kit (ADK) (Google) — verify against official docs.

Principles

  • Autonomy belongs to action types, not to the agent as a whole.
  • Every stage publishes both a metric bar and a hold time. The hold time stops one good week from promoting a risky action.
  • Promote one action at a time. Demote automatically.
  • Internal acceptance is not customer success. The Walk-to-Run gate uses a different metric from the Crawl-to-Walk gate.

Known failure modes (2)

Related patterns (3)

Related compositions (2)

Related methodologies (1)

Sources (3)

Provenance

  • Added to catalog:
  • Last updated:
  • Verification status: verified