Cross-Reflection
Reflection step performed by a *different* agent or foundation model from the original generator, so critique error is decorrelated from generation error.
Problem
Self-critique by the same model misses correlated failure modes: the generator's hallucinations get reproduced in its own review of those hallucinations. After one or two iterations, the loop self-approves. The fix requires a critic with different blind spots — a different model architecture, different training data, or both.
Solution
Generator (Model A) produces draft. Critic (Model B, distinct architecture) reviews draft against named criteria. If Model B accepts, ship. If Model B rejects, either revise (back to Model A with critique) or escalate. Pair with frozen-rubric-reflection so the critic uses fixed criteria, not free-form. Distinct from same-model-self-critique and llm-as-judge (which is judge-only without iteration).
When to use
- Output quality matters more than per-call cost.
- Two distinct capable models are available.
- Critique criteria can be expressed as a fixed rubric the critic checks.
Open the full interactive page →
Diagram, neighbourhood map, code examples, related patterns and full provenance.