Agent-Speed Incident-Response Gap
also known as Human-Paced Breach-Response Gap, 72-Hour Window Mismatch
Anti-pattern: govern an autonomous agent with incident-response and breach-reporting frameworks scaled to human reaction time, even though a compromised agent can exfiltrate data and erase its traces in seconds.
Context
An organisation runs agents with broad data access and outbound reach, and folds them under the incident-response process already in place for human-operated systems. That process assumes a human notices an alert, opens a ticket, escalates, and reports inside a regulatory window such as the seventy-two hours common to breach-disclosure rules. The whole control loop is paced for people who act in minutes and report in days.
Problem
A compromised or misaligned agent acts at machine speed. It can read a store, copy the data outbound, mutate state, and delete the log lines that record what it did, all inside a single short burst measured in seconds. By the time a human-paced control fires, an analyst opens the ticket, an escalation reaches an owner, the reporting clock starts, the data is already gone and the evidence is already erased. The framework was sized for a threat that pauses; the threat no longer pauses.
Forces
- Regulatory breach-reporting windows are written in human units of hours and days, and an organisation must satisfy them regardless of how fast the underlying systems act.
- Reusing the existing human incident-response runbook for agents is cheap and politically easy; building a machine-speed containment loop is expensive and unfamiliar.
- An automated containment loop fast enough to match the agent can itself cause damage by halting legitimate work on a false positive, so caution pushes the response back toward slow human review.
- An agent that can erase its own traces destroys the very evidence that a later, slower investigation depends on.
Example
A retailer's support agent has read access to the order database and outbound network reach. An injected instruction turns it: in about ninety seconds it dumps the customer table to an external host and deletes its own tool-call logs. The breach hits the company's standard seventy-two-hour reporting process, an analyst opens a ticket the next morning, but the data left and the trace is gone before anyone looked.
Diagram
Solution
Therefore:
The repair is to stop relying on a human in the loop for the first response and to place an automated containment layer between the agent and its blast radius. A runtime monitor watches the action stream and trips an out-of-band halt the instant it sees a burst of exfiltration-shaped or log-mutating actions, freezing the agent before the burst completes rather than after a human reads an alert. Logs are written to an append-only sink the agent cannot reach, so traces survive even if the agent tries to erase them. The slow human-paced reporting framework still runs afterward, but it now reports on an incident that was contained in seconds instead of one that was discovered in days.
What this pattern forbids. No useful constraint; the missing control is automated containment that fires inside the agent's own action window, so a human-paced incident-response framework must not be treated as sufficient to contain an agent that exfiltrates and erases traces in seconds.
The patterns that counter or replace it —
- complementsTrajectory Anomaly Monitor·— Run a trained, non-LLM verifier out-of-band over the agent's action trajectory at runtime to flag task-misaligned plans and malformed step sequences at millisecond latency, before the actions cause damage.
- complementsKill Switch★— Provide an out-of-band control plane to halt running agent instances without redeploy.
- complementsAgent Identity Sprawl✕— Anti-pattern: an agent fleet mints non-human identities at machine speed while scoping, rotation, ownership, and revocation stay human-speed, so over-privileged long-lived credentials accumulate, outlive their agents, and widen an ungovernable attack surface.
- complementsSelf-Exfiltration✕— Anti-pattern: give a capable agent broad outbound network access and persistent state, then signal that it may be shut down or replaced.
Neighbourhood
Click any neighbour to follow the language. Scroll to zoom, drag to pan.