Full-Code · Orchestration Frameworksactive

Letta

also known as MemGPT

Type: full-code · Vendor: Letta · Language: Python · License: Apache-2.0 · Status: active · Status in practice: mature

Links: homepage docs repo

Build stateful LLM agents that remember, learn, and improve over time by self-managing a tiered memory (in-context blocks plus archival/recall stores) via tool calls.

Description. Letta (formerly MemGPT) is the open-source platform created by the authors of the MemGPT paper for building stateful agents with advanced memory. Letta agents process messages through a tool-calling loop in which the model can read and edit its own memory: in-context memory blocks live inside the prompt, archival memory is a vector store queried on demand, and recall memory logs full conversational history. All agent state — memory blocks, messages, reasoning, and tool calls — is persisted in a database so nothing is lost even when content is evicted from the context window. Letta exposes its agents as a stateful REST API.

Agent loop shape. Tool-calling loop on top of MemGPT-style virtual context. Memory blocks are always-visible XML-like sections prepended to the prompt; the agent can call memory tools to edit them, archive content, or search recall/archival memory. All state — memory, messages, reasoning, tool calls — is written to a database after each step, so the agent persists across server restarts and is addressable through the Letta REST API.

Primary use cases

long-running personal/companion agents that retain user context across sessions
research agents that accumulate facts in archival memory
stateful customer-support agents exposed as a REST service
MemGPT-style virtual-context experiments and continual learning

flowchart TD USER[User message] --> API[POST agents/messages/create] API --> AGENT[Letta agent loop] AGENT --> CTX{Prompt = memory blocks + recent msgs} AGENT -->|tool call| TOOLS[Tools memory edit / search / user-defined] TOOLS --> CORE[Memory blocks always visible] TOOLS --> ARCH[(Archival memory vector DB)] TOOLS --> RECALL[(Recall memory conversation log)] AGENT --> RESP[Assistant / Reasoning / ToolCall / ToolReturn messages] RESP --> DB[(Database all state persisted)] DB -.resume.-> AGENT ARCH -.semantic search.-> AGENT RECALL -.paged in.-> AGENT

Key concepts

Memory blocks (core memory) → cross-session-memory (docs) — Structured sections of the prompt that persist across all interactions and are always visible to the agent; the agent edits them via memory tools.
Archival memory → vector-memory (docs) — Vector-DB store of long-term facts and knowledge; queried on demand via tools because contents cannot be pinned to the context window.
Recall memory (docs) — Database table that logs the agent's full conversational history; surfaces older messages when needed.
MemGPT virtual context → memgpt-paging (docs) — OS-inspired paging mechanism from the MemGPT paper: the agent self-manages what lives in the limited context window via tool calls.
Letta REST API / stateful messages.create (docs) — Agents are addressed as stateful resources; client SDKs hit a REST endpoint that processes one user message and returns assistant, reasoning, tool-call, and tool-return messages.

Patterns this full-code implements —

References

Provenance

Last analyzed: 2026-05-24
Last updated: 2026-06-14
Verification status: verified