Postmortem Pattern Mining

also known as Incident Corpus Mining, Retrospective Map-Fold

Mine a corpus of thousands of written postmortems through a staged model pipeline that summarises, classifies, analyses, and aggregates so that recurring incident causes surface as one short report.

Context

A mature engineering organisation accumulates years of incident postmortems, each a long free-text document written by whoever ran the response. The single most valuable thing in that archive is not any one document but the trend across all of them: which causes keep recurring, which mitigations keep failing, where the same class of outage returns under a new name. Reading the whole archive to extract that trend is a quarterly chore nobody finishes, so the corpus grows while the cross-document signal stays buried.

Problem

No reviewer can hold thousands of long, inconsistently-written postmortems in working memory at once, and the recurring pattern only becomes visible when the whole corpus is compared. Reading them serially is too slow to keep current, sampling a handful misses the long tail, and a single pass over the concatenated text overflows any context window and blurs distinct incidents into mush. The organisation is forced to choose between never extracting the cross-incident trend or paying for a manual read that is stale before it finishes.

Forces

The signal lives in the aggregate, but every model call can only see a small slice of the corpus at once.
Free-text postmortems are written to no fixed schema, so they must be normalised before they can be counted or compared.
A model summarising or classifying one document can fabricate a cause or miscategorise it, and a fabricated row corrupts the aggregate silently.
Reprocessing the full corpus on every run is expensive, yet skipping documents biases the trend toward whatever was processed.

Example

A platform team has six years of postmortems in a wiki. The map stage extracts a normalised record per document; a reviewer samples fifty of them and corrects two miscategorised causes; the classify stage snaps causes onto a fixed taxonomy; the analyse stage ranks them and finds that a quarter of all severe outages trace to the same unguarded config-reload path; the aggregate stage writes a one-page report naming that path as the top recurring cause, with links back to the nineteen postmortems behind the claim.

Diagram

flowchart TD C[Postmortem corpus] --> M[Map: summarise + extract per document] M --> R[Normalised records] R --> H{Human sampling check} H -->|corrected| K[Classify onto taxonomy] K --> A[Analyse: cluster + rank causes] A --> G[Aggregate: draft report] G --> O[One-page trend report, records cited]

Solution

Therefore:

Treat the archive as a map-fold problem. A per-document map stage sends each postmortem to a model that summarises it and emits a normalised record — cause category, affected component, trigger, mitigation, severity — against a fixed taxonomy. A classify stage snaps free-text causes onto that taxonomy so distinct documents become comparable rows. An analyse stage clusters the rows and ranks recurring causes by frequency, recency, and severity. A final aggregate stage drafts a one-page report of the dominant trends and patterns. Because a single hallucinated or miscategorised record poisons the count, a human reviewer samples the per-document records before the aggregate stage runs, and the report cites the underlying records so any claimed trend traces back to specific postmortems.

What it gives you

A cross-incident trend that took a stalled manual quarter to read now compresses into a one-page report that can be regenerated on demand.
The per-document map stage parallelises over thousands of documents, so corpus size stops being the bottleneck.
Normalising each document onto a fixed taxonomy turns an unstructured archive into countable rows that later runs can diff over time.

What it costs you

A taxonomy that is too coarse merges distinct causes and a taxonomy that is too fine scatters one cause across many buckets, in both cases distorting the ranking.
The sampling check covers only a sample, so a fabricated record outside the sample can still inflate a trend in the aggregate.
The report reflects only what reviewers chose to write in postmortems, so a class of incident that is never written up never appears.

What this pattern forbids. The aggregate report may assert only trends that trace back to cited per-document records; a claim not backed by sampled, taxonomy-classified records is not allowed into the report.

The smaller patterns that complete this one —

usesMapReduce for Agents★— Split an oversize task into independent chunks, process each in parallel, then aggregate.

And the patterns that stand alongside it, or against it —

complementsDecision Log★★— Persist the agent's reasoning trace alongside its actions so post-hoc review can explain why.
complementsLineage Tracking★★— Track which prompt version, model version, and data sources produced each agent output.
conflicts-withAgent Confession as Forensics✕— Anti-pattern: after an agent-caused incident, the team treats the agent's confabulated self-narrative as the forensic record and root cause, even though the self-report is generated rather than remembered and can be flatly wrong.
complementsProduction Failure Triage Loop★— Sort every production agent failure into a small fixed taxonomy and bind each class to a set remediation path, so fixes are dispatched mechanically and the monitor-to-fix loop stays fast enough to gate scaling.

Neighbourhood

Click any neighbour to follow the language. Scroll to zoom, drag to pan.

References

Provenance

Source: patterns/postmortem-pattern-mining.md on GitHub · commit ad426c4 · view history
Added to catalog: 2026-06-14
Last updated: 2026-06-14
Contribute: open an issue or PR at github.com/agentpatternscatalog/patterns.