Methodology · Agent Constructionemergingverified

Four-Tier Agent Memory Construction

also known as conversational-semantic-episodic-procedural memory build, CSEP memory stack

Applies to: agent

Tags: memoryfour-tierconversational-semantic-episodic-proceduralcompression

Build agent memory as four parts that work together. Conversational memory holds the recent turns. Semantic memory holds facts and their embeddings. Episodic memory holds traces of past interactions. Procedural memory holds learned skills and routines. Each part has its own rules for what to write, how to look things up, and how to shrink it when it gets too big. This rejects the shortcut of using one vector store for everything. It makes the team decide, for each part, what is stored, how it is retrieved, and how it is compressed when it fills up.

Methodology process overview

flowchart TD budget[Memory budget] --> s1[Build conversational tier] recall[Recall scenarios] --> s1 s1 --> s2[Build semantic tier] s2 --> s3[Build episodic tier] s3 --> s4[Build procedural tier] s4 --> s5[Define retrieval orchestration across tiers] s5 --> s6[Define compression and forgetting policies] s6 --> out1[Four-tier memory architecture] s6 --> out2[Per-tier policies] s1 -.-> conv[(Conversational)] s2 -.-> sem[(Semantic)] s3 -.-> epi[(Episodic)] s4 -.-> proc[(Procedural)]

Intent. Replace 'agent memory is one vector store' with four clear parts, conversational, semantic, episodic, and procedural, each with its own rules.

When to apply. Use this when you design memory for any agent that must remember things beyond a single conversation: assistants, coding agents, and long-running ops agents. It helps most when a team has hit recall problems with one plain vector store. Don't apply it for single-turn agents and stateless tools. Skip it too when the model's context window already covers the memory needs and you do not need lookup at all.

Example scenario

A team building a long-running personal-finance assistant saw users complain that it 'forgot' simple facts after a few weeks. At the same time it kept dragging in context that did not matter. They had used a single pgvector index for everything. They rebuilt memory as the four parts. Inputs: a memory budget of 8k tokens per turn and about 1GB of storage per user, plus recall scenarios pulled from real user logs (such as 'what's my emergency-fund target?', 'how did we handle my Q2 bonus last year?', and 'do my usual end-of-month sweep'). Conversational part: a sliding window that keeps the last six turns plus a rolling summary. Semantic part: extracted facts such as target_emergency_fund=$15000, stored in a vector index for similarity lookup. Episodic part: per-interaction transcripts, timestamped and tagged, searchable by date and topic. Procedural part: 'end-of-month sweep' stored as a routine with steps the agent could replay. Lookup routing: fact questions hit semantic first, 'last time' questions hit episodic, recurring-task questions hit procedural. Compression: episodic memories shrank to summaries after 90 days, and a routine was promoted only after three successful runs. The forgetting rule was the surprise win. They chose to drop stale facts, such as the payroll cadence from the user's old job, instead of letting them spoil lookups. Recall-precision on the eval set went from 0.62 to 0.89. The 'agent dragged in random old context' complaints stopped.

Inputs

Memory budget — The limits for each part: how many tokens, how much storage, and how much delay you can afford.
Recall scenarios — The questions or moments where the agent must remember something, and which part should answer each one.

Outputs

Four-tier memory architecture — Conversational, semantic, episodic, and procedural stores, each with a clear interface.
Per-tier policies — Write, lookup, and compression rules for each part, written down and easy to tune.

Steps (6)

Build the conversational tier
Keep a window on the live conversation. Pick how you trim it (sliding window, summary plus window, or layered summaries) and decide what gets promoted to the longer-lived parts.
usesShort-Term Thread Memory Episodic Summaries
Build the semantic tier
Store extracted facts and their embeddings so you can look them up by similarity. Decide what counts as a fact and what stays as an episode.
usesSemantic Memory Vector Memory
Build the episodic tier
Save real past interactions and events with timestamps and IDs. Episodes are the agent's diary. You use them to recall what happened last time.
usesEpisodic Memory
Build the procedural tier
Store skills, routines, and plans that worked, so the agent can replay them. This is how the agent gets faster and better at a recurring task.
usesProcedural Memory Skill Library
Define retrieval orchestration across tiers
Decide which part answers which kind of recall request. Decide how to merge or rank results when more than one part replies.
Define compression and forgetting policies
Give each part a way to shrink: summarise, cluster, decay by recency, or drop on purpose. Without these rules the parts grow without limit.
usesEpisodic Summaries Memory-Type Storage Specialization

Framework-specific instructions

Pick a framework and generate a framework-targeted rewrite of this methodology's steps.

Choose framework

AI-generated for Agent Development Kit (ADK) (Google) — verify against official docs.

Principles

Memory is not one store. It is four parts that work together with different jobs.
Every part has its own write, read, and compression rules.
Look-ups are routed across the parts, not handed to a single vector index.
Forgetting is a feature. Design it on purpose.

Known failure modes (3)

Related patterns (7)

Related compositions (1)

recipe · abstract shape
Memory Architecture
How long-running agents structure what they remember: tiered short-to-long-term cascade, compaction across the window, paging, and reasoning carry-forward across tool calls.

Related methodologies (1)

Agentic Workflow Construction★★
7 steps
Make agent authors name the four parts and the freedom level before they code, so a failure points to one part instead of smearing across a vague agent.

Sources (2)

Provenance

Added to catalog: 2026-05-24
Last updated: 2026-05-27
Verification status: verified

Methodology process overview

Steps (6)

Build the conversational tier

Build the semantic tier

Build the episodic tier

Build the procedural tier

Define retrieval orchestration across tiers

Define compression and forgetting policies