DeerFlow 2.0 (SuperAgent harness)

Type: full-code · Vendor: ByteDance · Language: Python, TypeScript · License: MIT · Status: active · Status in practice: emerging · First released: 2026-02

Links: homepage docs repo

Run long-horizon tasks that research, code, and create by giving a single lead agent the ability to dynamically spawn bounded, context-isolated subagents that execute code and tools inside Docker sandboxes and then synthesise their structured results.

Description. DeerFlow 2.0 is a ground-up rewrite of ByteDance's DeerFlow that shares no code with the 1.x deep-research graph and became the active line after its launch around late February 2026. Rather than a fixed LangGraph research workflow, it is a general open-source SuperAgent harness: a lead agent receives the request, decomposes it, and dynamically spawns subagents on the fly, each with its own scoped context, tools, and termination conditions. Subagents run in parallel when possible and report structured results back to the lead agent, which synthesises them into the final deliverable. The roster is not fixed: built-in subagents (a general-purpose agent and a bash agent) sit alongside user-defined custom agents declared in config.yaml, and concurrency is bounded by a SubagentLimitMiddleware. Each subagent executes inside an isolated sandbox — the AioSandboxProvider gives it a real filesystem and bash terminal in a Docker container (local, Docker, or Kubernetes-provisioned), while the LocalSandboxProvider is explicitly gated as not a secure boundary. Capabilities are extended through Agent Skills (SKILL.md capability modules, public or custom, loaded progressively) and through tools that include web search, web fetch, file operations, and bash, plus MCP servers exposed as deferred tools the agent must look up via a tool_search tool before calling. A persistent cross-session memory subsystem accumulates the user's profile and preferences, and a message gateway fronts the harness with LangGraph-compatible HTTP routes. The lead agent itself runs as a LangChain agent wrapped in a middleware chain (loop detection, summarisation, todo tracking, subagent limiting, sandbox auditing, and more). The original deep-research graph is catalogued separately as DeerFlow 1.x and is maintained on the `main-1.x` branch.

Agent loop shape. A lead agent runs a LangChain agent loop wrapped in a middleware chain; on each step it may call the `task` tool to delegate work to subagents (general-purpose, bash, or custom skill-scoped), each of which runs its own bounded ReAct-style loop inside an isolated Docker/AIO sandbox and returns a structured result. Concurrent subagent spawns are truncated to a clamped maximum, skills and deferred MCP tools are loaded lazily, and the lead agent synthesises subagent results into the final output while a cross-session memory subsystem persists profile and preferences across runs.

Primary use cases

long-horizon tasks (minutes to hours) that mix research, coding, and content creation
autonomous agents that need a real filesystem, bash terminal, and safe code execution
dynamic task decomposition where the lead agent spawns parallel subagents on demand
skill- and MCP-extensible super agents with persistent cross-session memory

flowchart TD GW["Message gateway: /api/langgraph/*"] --> LA["Lead agent: decompose and synthesize"] SK[("Skills: SKILL.md, loaded progressively")] -.-> LA MEM[("Cross-session memory: profile and preferences")] -.-> LA LA -->|"task tool, bounded by max_concurrent"| SA1["Subagent: general-purpose"] LA -->|"task tool"| SA2["Subagent: bash"] LA -->|"task tool"| SAN["Subagent: custom / skill-scoped"] SA1 --> SB["Sandbox: Docker/AIO isolated FS and bash"] SA2 --> SB SAN --> SB SA1 -->|"structured result"| LA SA2 -->|"structured result"| LA SAN -->|"structured result"| LA LA --> OUT["Finished deliverable"]

Key concepts

Orchestrator-Workers → orchestrator-workers — A lead agent decomposes the request and dispatches sub-tasks to subagents via a `task` tool, then synthesises their structured results into the finished deliverable — the harness's central orchestration shape.
Subagent Isolation → subagent-isolation — Each subagent is spawned with its own scoped context, tool set, and termination conditions, and cannot see the parent or sibling contexts; the built-in general-purpose agent is recommended precisely when a task benefits from isolated context management.
Parallel Fan-Out / Gather → parallel-fan-out-gather — Subagents run in parallel when possible, report back structured results, and the lead agent gathers and synthesises them; a single model response can emit several parallel `task` calls at once.
Agent Skills → agent-skills — Capabilities are packaged as Agent Skills — SKILL.md modules (public/built-in or user-authored custom) that define a workflow and supporting resources, loaded progressively so only the skills a task needs enter context.
Sandbox Isolation → sandbox-isolation — Subagents execute inside isolated sandboxes: the AioSandboxProvider provisions Docker containers (local or Kubernetes-backed) with a real filesystem and bash, while the LocalSandboxProvider is explicitly gated off for host bash because it is not a secure boundary.

DeerFlow 2.0 (SuperAgent harness)

Neighbourhood

Anti-patterns avoided

Alternatives & relatives

Listed as alternative by (1)

References

Provenance