← All booksBook VIII

Safety & Control

Hard limits on what the agent may do.

66 patterns in this book. · Updated 2026-06-27

Top 5 patterns in Safety & Control by usage

↓ download as png

AGENT PATTERNS · BOOK VIII · SAFETY & CONTROL

Top 5 patterns by usage

agentpatternscatalog.org

Step Budget
a.k.a. Max Steps · Iteration Cap
Cap the number of tool calls or loop iterations the agent is allowed within a single request.
×35 compositions
Approval Queue
a.k.a. Async Approval · Supervisor Inbox
Queue agent-proposed actions for asynchronous human review while the agent continues other work.
×33 compositions
Human-in-the-Loop
a.k.a. HITL · Approval Gate
Require explicit human approval at defined points before the agent performs an action.
×27 compositions
Input/Output Guardrails
a.k.a. Guards · Validators
Validate inputs before they reach the model and outputs before they reach the user.
×16 compositions
Conversation Handoff to Human
a.k.a. Escalation · Live-Agent Handoff
Transfer the entire conversation thread from agent to human operator, with state transfer and return primitive.
×14 compositions

When to reach for each

01. Step Budget Cap the number of tool calls or loop iterations the agent is allowed within a single request. Best for: The agent has any kind of loop (ReAct, plan-execute, debate). Tradeoff: Can hide deeper bugs (the agent really should stop earlier). Watch for: Never. Step Budget is universal hardening for any agent loop.

02. Approval Queue Queue agent-proposed actions for asynchronous human review while the agent continues other work. Best for: Some agent actions require human review but blocking the agent until review completes is unacceptable. Tradeoff: Inbox fatigue at scale. Watch for: Every action needs synchronous approval and there is no parallel work to do.

03. Human-in-the-Loop Require explicit human approval at defined points before the agent performs an action. Best for: Action consequences at a defined boundary are too costly to leave to the model alone. Tradeoff: User experience friction. Watch for: Decisions must be made in unattended or sub-second autonomous settings.

04. Input/Output Guardrails Validate inputs before they reach the model and outputs before they reach the user. Best for: User inputs may carry malicious or out-of-policy content the model should not act on. Tradeoff: False positives are user-visible. Watch for: The deployment is fully internal and validated by other layers already.

05. Conversation Handoff to Human Transfer the entire conversation thread from agent to human operator, with state transfer and return primitive. Best for: Some triggers (low confidence, policy violation, explicit user request) demand transferring ownership of the whole thread, not just one action. Tradeoff: Operator queue capacity bounds scale. Watch for: Discrete-action approval is sufficient and full thread transfer is overkill (use approval-queue).

When to reach for each

All patterns in this book