VII · Verification & ReflectionEmerging

Human Reflection

also known as Human-Critique-In-Reflection-Loop, Human-Feedback Refinement

Reflection loop that explicitly collects human feedback (not approval) on agent plans to improve them, distinct from approval gates where the human only says yes/no.

This pattern helps complete certain larger patterns —

  • specialisesHuman-in-the-Loop★★Require explicit human approval at defined points before the agent performs an action.
  • specialisesReflection★★Have the model review its own output and produce a revised version in one or more passes.

Context

A team has an agent that produces plans, drafts, or analyses. Human-in-the-loop is in place but limited to approving or rejecting the final output. Humans see the output but cannot easily inject critique that the agent must act on.

Problem

Yes/no approval underuses the human's expertise. A reviewer often knows *why* something is wrong and could improve it with a suggestion, but the approval workflow has no channel for that suggestion to become an agent revision. The agent ships approved-but-imperfect outputs; the reviewer takes the burden of editing manually.

Forces

  • Pure approval workflows are simpler and faster than feedback loops.
  • Human feedback adds latency to the production cycle.
  • Feedback quality varies — agents must handle low-signal feedback gracefully.

Example

A research-summary agent produces a draft for a senior analyst. Instead of {approve, reject}, the UI shows the draft with a critique widget. Analyst writes 'the methodology section omits the control group'. The agent ingests the critique and emits a revised draft. Three rounds. The third draft is approved. The captured critiques feed a quarterly rubric update.

Diagram

Solution

Therefore:

Render agent output to the human with a structured feedback widget (critique text + optional structured fields like 'wrong section', 'missing claim'). On submit, the agent ingests the feedback as a critique and produces a revision. Loop until human approves OR loop budget exhausts. Differs from approval-queue (yes/no) and from human-in-the-loop (which subsumes both). Pair with reflection, frozen-rubric-reflection, approval-queue.

What this pattern forbids. The agent must treat human feedback as a critique input subject to revision, not as a binary signal; a loop budget caps the number of revision rounds.

And the patterns that stand alongside it, or against it —

  • alternative-toApproval Queue★★Queue agent-proposed actions for asynchronous human review while the agent continues other work.
  • complementsFrozen Rubric ReflectionConstrain reflection to a fixed, hand-authored rubric of criteria so the reviewer cannot invent new ones each run.
  • complementsEvaluator-Optimizer★★One LLM generates; another evaluates and feeds back; loop until criteria are met.
  • complementsConfidence-Checking WorkflowAlways ask the agent, for each part of its output, to state its confidence and identify which parts need human verification, like triaging a junior analyst's work.
  • complementsCooperative Preference Inference·Agent and human jointly optimise the human's reward without the agent being told what it is — the interaction is a two-player game in which alignment is learned while acting.

Neighbourhood

Click any neighbour to follow the language. Scroll to zoom, drag to pan.

References

Provenance