Verification & Reflection

Cross-Reflection

Reflection step performed by a *different* agent or foundation model from the original generator, so critique error is decorrelated from generation error.

Problem

Self-critique by the same model misses correlated failure modes: the generator's hallucinations get reproduced in its own review of those hallucinations. After one or two iterations, the loop self-approves. The fix requires a critic with different blind spots — a different model architecture, different training data, or both.

Solution

Generator (Model A) produces draft. Critic (Model B, distinct architecture) reviews draft against named criteria. If Model B accepts, ship. If Model B rejects, either revise (back to Model A with critique) or escalate. Pair with frozen-rubric-reflection so the critic uses fixed criteria, not free-form. Distinct from same-model-self-critique and llm-as-judge (which is judge-only without iteration).

When to use

Output quality matters more than per-call cost.
Two distinct capable models are available.
Critique criteria can be expressed as a fixed rubric the critic checks.

Open the full interactive page →

Diagram, neighbourhood map, code examples, related patterns and full provenance.

Problem

Solution

When to use

Related