Planning & Control Flow

ReWOO

Plan a complete dependency DAG with placeholder variables before any tool runs, then execute and substitute observations into the plan.

Problem

In a ReAct loop (reason-act-observe), every tool observation is fed back into the planner's prompt for the next reasoning turn. Token cost therefore grows roughly with the square of the step count, because each turn carries the trace of all the previous turns. On an eight-step task the planner re-reads its own scratch reasoning and all prior observations seven times. Most of those re-reads do not change the plan — the structure was knowable up front — so the team is paying for re-prompting that produces no new decisions.

Solution

Three roles. Planner emits a DAG with steps `t1 = ToolA(x); t2 = ToolB(#t1)` using variable references. Worker executes each tool in dependency order. Solver reads the resolved trace and produces the final answer. The planner never sees observations.

When to use

Most planning steps do not depend on early observations and can be planned upfront.
ReAct-style observation re-injection is the dominant token cost.
Tools have stable signatures so the planner can reference outputs by variable.

Open the full interactive page →

Diagram, neighbourhood map, code examples, related patterns and full provenance.

Problem

Solution

When to use

Related