Risk-Tiered Action Autonomy

also known as Maker-Checker Agent Boundary, Segregation-of-Duties Action Gate, Four-Eyes Action Release

Set an agent's permitted action class by the financial materiality of the action, letting it read and draft freely while requiring a different human principal to release material postings, payments, or filings.

Context

An agent operates inside a financial system of record — an ERP, accounting suite, treasury, or payments platform — where actions range from harmless reads to irreversible movements of money. Regulators and internal controls already require that the person who initiates a financial transaction is not the person who approves it, and that material postings leave an audit trail. The agent is fast and tireless at the low-risk end of this range and genuinely dangerous at the high-risk end, and the same model produces both kinds of action.

Problem

Treating autonomy as a single switch forces a bad choice: granted wholesale, the agent can post journals, release payments, or file returns on its own, collapsing the segregation of duties that controls depend on; withheld wholesale, every trivial reconciliation waits on a human and the agent is not worth running. A single global approval step does not help either, because it makes a human rubber-stamp thousands of low-risk drafts while giving a high-value payment the same shallow glance. Worse, if the human approver is the same identity that launched the agent, the four-eyes control exists only on paper.

Forces

Low-risk, high-volume actions are where the agent pays for itself, and gating them all on a human destroys that value.
High-risk, low-volume actions are where an error is expensive or irreversible, and skipping a human is unacceptable.
Segregation of duties requires that the approver be a different principal from the initiator, which a single-operator agent setup quietly violates.
Materiality is the dimension regulators and auditors reason about, but it is not the same as token cost or model confidence.
Approval fatigue makes a uniform gate worse than useless: reviewers stop reading when most of what they see is trivial.

Example

A finance team runs an agent in its ERP to clear the accounts-payable backlog. The agent reads ledgers and reconciles freely; it codes and stages invoices as drafts with full logging; but any payment above a set amount, any new payee, or any cross-border transfer is held as a draft on an approval queue. A controller — signed in as a different user than the analyst who started the agent — reviews only those held items and releases them. Every release records who initiated, who approved, the tier the action landed in, and the rule that put it there, so an auditor can later confirm that no material payment moved without two distinct people.

Diagram

flowchart TD P[Proposed action] --> C{Tier classifier<br/>amount · account · counterparty} C -->|read / analyse| A1[Execute autonomously + log] C -->|draft / stage| A2[Write draft + log] C -->|material| H[Hold as draft on release queue] H --> R{Approver != initiator?} R -->|yes| X[Release and execute] R -->|no| H A1 --> T[(Immutable trail:<br/>initiator, approver, tier, rule)] A2 --> T X --> T

Solution

Therefore:

Define a small set of risk tiers over the action surface, keyed on materiality rather than on cost or model confidence: for example read and analyse freely, draft and stage with full logging, and hold-for-release for anything that moves money or alters the books above a threshold. Classify every proposed action into a tier before it executes, using deterministic rules — amount thresholds, account sensitivity, counterparty, jurisdiction — rather than the model's own judgement of its risk. Actions in the autonomous tiers run and are logged; actions in the release tier are written as drafts and placed on a queue that only a human can clear. Bind that release step to a different identity than the one that initiated the agent run, so the maker (agent plus its operator) and the checker (the approver) are structurally separate. Record initiator, approver, tier, and the rule that set the tier in an immutable trail so the control is auditable after the fact.

What it gives you

Autonomy is high where it is safe and bounded where it is costly, instead of one global setting that is wrong at both ends.
Segregation of duties holds by construction because the release identity is structurally distinct from the initiator.
Reviewers spend attention on the few material actions instead of rubber-stamping trivial drafts.
The materiality rule that set each tier is recorded, so an auditor can replay why an action ran on its own or waited for a human.

What it costs you

Defining materiality tiers and thresholds is domain work the finance or controls function must own, not engineering.
A mis-tuned threshold either leaks a material action into an autonomous tier or floods the release queue and revives approval fatigue.
Enforcing approver-distinct-from-initiator requires real identity and role data a thin single-operator deployment may not have.
The release queue adds latency to exactly the actions that are most time-sensitive, such as a payment near a cut-off.

What this pattern forbids. The agent must not execute any action classified into the release tier without an explicit clearance from a human principal whose identity differs from the initiator of the agent run; it must not select or downgrade its own risk tier; and a material action whose tier cannot be determined must be held rather than executed.

And the patterns that stand alongside it, or against it —

alternative-toCost-Aware Action Delegation★— Classify every agent action by risk/cost and route each tier to a different approval policy, bounding the autonomy surface per-action instead of by one global flag.
complementsAutonomy Slider★— Expose agent autonomy as a continuous adjustable parameter so the same codebase can span scripted assistant to fully autonomous worker without re-architecting.
complementsProgressive Delegation★— Stage the human-to-agent handoff over time: the agent starts producing drafts a human always reviews; its autonomy expands action-by-action as measured trust accrues.
complementsApproval Queue★★— Queue agent-proposed actions for asynchronous human review while the agent continues other work.
complementsHuman-in-the-Loop★★— Require explicit human approval at defined points before the agent performs an action.
complementsSession-Scoped Payment Authorization·— Bound an agent's autonomous spending by having it open a payment session with a pre-approved cap, stream many micropayments inside that session, and settle once on close, instead of seeking approval for every transaction.
complementsPolicy-as-Code Gate★— Evaluate every proposed agent action against externally-managed machine-readable policies before dispatch, so compliance authorship lives outside the prompt and outside the agent code.
complementsCompensating Action★★— Pair every irreversible-looking agent action with a compensating action that can undo or counteract it.
complementsTenant-Scoped Tool Binding★— Bind every tool call and retrieval to the active tenant in code at the execution layer, so a multi-tenant agent can never be talked into reading or writing another tenant's data.
complementsCanonical-Entity Grounding★— Require the agent to resolve every business identifier it uses — SKU, account, supplier, customer — through an authoritative lookup against the system of record, rather than emitting the identifier from the model's parametric memory.
complementsMandatory Red-Flag Escalation★★— Maintain a deterministic set of high-risk triggers so that on any match the agent immediately aborts its workflow and hands off to a human, without weighing whether to escalate.
complementsSemantic-Layer Query Guardrail★— Route natural-language data questions through a curated semantic layer so the model selects and parameterises vetted metrics and dimensions instead of free-authoring raw SQL against production data.
complementsReversibility-Aware Action Filter·— Insert a standing filter between the policy and the environment that estimates each proposed action's reversibility and re-samples the policy until a reversible action is chosen.
complementsDeterminism-Tiered Replay Gate·— Classify an agent into a reproducibility tier by re-running identical inputs, require the strictest decision-determinism tier for regulated decisions, and gate deployment and validation-sample size on the measured tier.
complementsChange-Freeze-Aware Action Gate★— Check every mutating agent action against an active deploy-freeze or maintenance calendar and block it or force explicit human re-authorisation while a freeze covering its scope is in effect.
complementsDeployment-Correlated Rollback Gate★— Gate an incident-response agent's authority to execute a rollback on whether the failure is temporally correlated with a recent deployment, unlocking autonomous rollback only on a clear deploy-to-failure link and escalating otherwise.
complementsRefund Threshold Drift✕— Anti-pattern: a correctly-set autonomous-refund cap drifts upward over time as the agent accommodates edge cases, so the effective approval ceiling and financial exposure grow silently with no hard limit to catch it.
complementsAgent Liability Insurance★— Transfer the residual risk of autonomous agent failure to an insurer through agent-specific coverage, with an auditable certification standard gating insurability, so unbounded liability becomes a bounded, priced cost.