Framework · Orchestration Frameworks

NVIDIA NeMo Guardrails

NeMo Guardrails adds programmable rails around an LLM-based conversational application so that inputs and outputs can be checked, rewritten, or rejected outside the model's own discretion.

Description

NeMo Guardrails is an open-source Python toolkit from NVIDIA for adding rule-based controls to LLM applications. Developers express rails in Colang, a flow language, and the runtime applies them at fixed stages: input rails on the incoming user message, dialog rails that steer the conversation, and output rails on the generated response. Each rail can reject, alter, or rephrase the text it inspects, so safety and topic checks run as deterministic code around the model rather than as model behavior.

Solution

A wrap-around guard pipeline: user input passes through input rails, then dialog rails select a flow and the LLM generates a response, then output rails inspect and may reject or rewrite that response before it returns to the user. Rails are deterministic Colang subflows that bracket each model call rather than a free agent loop.

Primary use cases

  • topic and scope control for domain-specific assistants
  • input sanitization and PII masking on user messages
  • output filtering and response rewriting
  • refusal of disallowed subjects
  • enforcing predefined conversational flows

Open the full interactive page

Diagram, neighbourhood map, code examples, related patterns and full provenance.