← All booksBook IX

Routing & Composition

Sending requests to the right specialist.

23 patterns in this book. · Updated 2026-06-14

Top 5 patterns in Routing & Composition by usage

↓ download as png

AGENT PATTERNS · BOOK IX · ROUTING & COMPOSITION

Top 5 patterns by usage

agentpatternscatalog.org

Multi-Model Routing
a.k.a. Cascade Routing · Cheap-First Routing
Send each request to the cheapest model that can handle it well.
×24 compositions
Fallback Chain
a.k.a. Cascade Fallback · Try-Then-Try-Else
Try a primary handler; on failure or low confidence, fall through to a sequence of fallback handlers.
×13 compositions
Circuit Breaker
a.k.a. Failure Trip · Rate-Limit Trip
Stop calling a failing dependency for a cooldown period after error rates exceed a threshold.
×4 compositions
Pipes and Filters
a.k.a. Pipeline · Streaming Pipeline
Compose stream-shaped processing as a chain of small filters connected by pipes.
×4 compositions
Automatic Workflow Search
a.k.a. AFlow · Workflow Synthesis
Treat the agent's workflow (a graph of LLM-invoking nodes) as an artefact to search; use Monte Carlo Tree Search guided by an eval benchmar…
×4 compositions

When to reach for each

01. Multi-Model Routing Send each request to the cheapest model that can handle it well. Best for: Cost and quality goals diverge across request types. Tradeoff: Two-model debug surface. Watch for: A single model already meets the price-performance target.

02. Fallback Chain Try a primary handler; on failure or low confidence, fall through to a sequence of fallback handlers. Best for: Single-handler failure would cascade to the user as an outage. Tradeoff: Cumulative latency on full cascade. Watch for: Only one handler exists and there is nothing to fall back to.

03. Circuit Breaker Stop calling a failing dependency for a cooldown period after error rates exceed a threshold. Best for: A dependency fails often enough that hammering it wastes cost or blocks legitimate traffic. Tradeoff: False trips degrade availability when the error was transient. Watch for: Failures are correlated across all dependencies and there is no useful fallback to route to.

04. Pipes and Filters Compose stream-shaped processing as a chain of small filters connected by pipes. Best for: A transformation can be decomposed into small filters with single responsibilities. Tradeoff: Pipeline visibility: hard to see end-to-end behaviour. Watch for: The transformation is small enough that a single function is clearer.

05. Automatic Workflow Search Treat the agent's workflow (a graph of LLM-invoking nodes) as an artefact to search; use Monte Carlo Tree Search guided by an eval benchmark to discover the best workflow, then deploy it. Best for: You have a stable eval benchmark that can score full workflows end-to-end. Tradeoff: Eval set quality bounds discovered workflow quality. Watch for: No reliable eval exists to guide the search.

Routing & Composition

Top 5 patterns by usage

Multi-Model Routing

Fallback Chain

Circuit Breaker

Pipes and Filters

Automatic Workflow Search

When to reach for each

All patterns in this book

Multi-Model Routing

Fallback Chain

Circuit Breaker

Pipes and Filters

Automatic Workflow Search

Provider Fallback

Routing

Open-Weight Cascade

Provider-String Routing

Graceful Degradation

Agent Persona Profile

Complexity-Based Routing

Hybrid Symbolic-Neural Routing

Mixture of Experts Routing

MRKL Systems (Modular Neuro-Symbolic)

Parallel Tool Calls

Parallelization

Prompt Chaining

Behavior-Space Architecture

BPMN/DMN Deterministic Shell Around Agent

Dynamic Scaffolding

Trust and Reputation Routing

SLA-Aware Triage Scoring