Refusal

also known as Decline, Out-of-Scope Response

Explicitly refuse requests that fall outside the agent's scope, capability, or policy boundaries.

Context

A team runs an agent with a defined scope — customer support for a specific product, technical help in a specific domain, internal operations for a specific team — and real users will ask it things outside that scope: medical advice from a banking agent, legal interpretation from a coding assistant, competitor comparisons from a vendor's own bot. Some of these requests are simply off-topic; others are unsafe, regulated, or beyond what the model can reliably do.

Problem

A helpful-by-default agent answers these out-of-scope questions anyway, producing plausible-sounding but unauthorised content: a stock pick from a system that has no business giving one, a dosage suggestion from a tool that is not a medical device, a confident wrong answer in a domain the model has not been validated against. Silently routing such requests through the model also strips the user of the signal that the agent has a boundary. Without an explicit, kind refusal at the named boundary, the agent drifts into territory that erodes trust and exposes the operator.

Forces

Over-refusal frustrates users.
Under-refusal lands the agent in trouble.
Refusal text quality matters; templated refusals feel insulting.

Example

A customer-service agent for a bank starts being asked for stock picks, legal advice, and competitor comparisons. Helpful-by-default, it answers and gets the bank into hot water. The team defines refusal triggers (regulatory boundary, out-of-scope, capability gap) and a kind, specific refusal template that names the boundary and points to a human team. Out-of-scope replies stop being plausible-sounding hallucinations and start being short, clear handoffs.

Diagram

flowchart TD Req[Request] --> T{Refusal trigger?} T -->|policy violation| Ref[Clear, kind refusal] T -->|out of scope| Ref T -->|capability gap| Ref T -->|regulatory| Ref T -->|none| Run[Handle normally] Ref --> Alt[Suggest alternative] Ref --> Log[(Refusal log)]

Solution

Therefore:

Define refusal triggers (policy violation, out-of-scope, capability gap, regulatory boundary). Return a clear, kind, specific refusal that names the boundary and (when possible) suggests an alternative. Log refusals for review.

What this pattern forbids. When triggers fire, the agent must refuse rather than attempt the task.

The smaller patterns that complete this one —

usesConstitutional Charter★— Define rules the agent reads every turn but cannot modify, encoding inviolable boundaries.
generalisesScope-of-Practice Boundary Gate★— Block requests and responses that perform license-gated professional activities unless a licensed human is in the loop, enforcing the boundary in code outside the reasoning loop.

And the patterns that stand alongside it, or against it —

complementsInput/Output Guardrails★★— Validate inputs before they reach the model and outputs before they reach the user.
conflicts-withCode-Switching-Aware Agent★— Treat mixed-language input (e.g. Hinglish in Roman script) as the expected shape, and design tokenisation, language tagging, and tool routing to handle it natively without forcing the user to commit to one language.
complementsPolicy-as-Code Gate★— Evaluate every proposed agent action against externally-managed machine-readable policies before dispatch, so compliance authorship lives outside the prompt and outside the agent code.
complementsTyped Refusal Codes★— Define a single source of truth for machine-readable refusal codes across all guard surfaces, so refusals can be triaged mechanically rather than by string-grepping ad-hoc human-readable messages.
complementsReflexive Metacognitive Agent·— Agent maintains an explicit self-model of its own capabilities, confidence and limitations, and reasons over that model when accepting / refusing / handing off tasks.
alternative-toOver-Helpfulness✕— Anti-pattern: the agent prioritises responsiveness and task completion over correctness, producing confident output for a request beyond its capability or scope instead of abstaining, clarifying, or handing off.
alternative-toEnforced Advisory Disclaimer★— Append a non-suppressible advisory framing every high-risk regulated answer as information rather than professional advice, attached outside the model's discretion so it survives pushback and model updates.

Neighbourhood

Click any neighbour to follow the language. Scroll to zoom, drag to pan.

Used in recipes

Safety Hardening
hardening

Used in frameworks

References

Constitutional AI: Harmlessness from AI Feedback
paper

Provenance

Source: patterns/refusal.md on GitHub · commit 4fa1213 · view history
Added to catalog: 2026-04-30
Last updated: 2026-05-21
Contribute: open an issue or PR at github.com/agentpatternscatalog/patterns.