Rasa

Type: full-code · Vendor: Rasa Technologies · Language: Python · License: Apache-2.0 · Status: active · Status in practice: mature · First released: 2016-10-15

Links: homepage docs repo

Developer platform for enterprise text and voice AI assistants combining LLM-based dialogue understanding (CALM) with deterministic business logic encoded as Flows, custom actions for tool calls, slots for collected data, and built-in patterns for human handoff and fallback.

Description. Rasa is an open-source (legacy Open Source) and commercial (Rasa Platform / Pro) framework for building text and voice conversational assistants. The current dialogue engine is CALM (Conversational AI with Language Models): an LLM interprets each user message and emits internal commands (start flow, set slot, correct slot, cancel flow, clarify flows, chitchat, knowledge answer, human handoff), while a deterministic dialogue manager executes those commands against business logic written as Flows. Flows are step-by-step processes built from collect (slot filling), set_slots, action (custom-action server call), and branching steps; the tracker persists slot values across turns.

Agent loop shape. Each turn the user message is fed to a CALM dialogue-understanding LLM with a structured prompt that emits a set of internal commands (start flow, set slot, correct slot, cancel flow, clarify flows, chitchat, knowledge answer, human handoff). The dialogue manager interprets those commands against the currently active Flow on the tracker, executing collect / set_slots / action / branching steps. Custom actions run in an external action server; the tracker carries slot state across turns. Conversation patterns provide reusable structured sub-flows for things like clarification, correction, and handoff.

Primary use cases

enterprise customer-support and CX assistants over text and voice
LLM-driven dialogue understanding with deterministic flows enforcing business logic
custom-action-driven tool use against internal APIs
human handoff via the built-in human_handoff command
knowledge-grounded answers via Enterprise RAG and the knowledge_answer command

flowchart TD user[User message] --> du[Dialogue understanding LLM] du --> cmds[Commands: start flow / set slot / cancel / chitchat / knowledge answer / human handoff] cmds --> dm[Dialogue manager] dm --> flow{Active Flow step} flow -->|collect| ask[Ask user, set slot] flow -->|set_slots| upd[Update tracker] flow -->|action| act[Custom action server call] flow -->|branch| br[Branch on slot value] ask --> tracker[Tracker slots] upd --> tracker act --> tracker br --> tracker tracker --> respond[Bot response] cmds -->|human_handoff| handoff[Live-agent handoff pattern] cmds -->|knowledge_answer| rag[Enterprise RAG] rag --> respond handoff --> respond

Key concepts

CALM (docs) — Conversational AI with Language Models — the dialogue system pairing an LLM-driven dialogue understanding layer with a deterministic dialogue manager.
Flows → plan-and-execute (docs) — Step-by-step descriptions of business processes (collect slots, run actions, branch on data); the LLM can dynamically route between flows.
Dialogue understanding / commands → structured-output (docs) — LLM emits internal commands (start flow, set slot, correct slot, cancel flow, clarify flows, chitchat, knowledge answer, human handoff) for the dialogue manager to execute.
Slots & tracker → cross-session-memory (docs) — Slots are typed values collected through flow steps; the tracker is per-conversation persistent state.
Custom actions → tool-use — External action server endpoints called from action steps for tool/API integration.
Human handoff → conversation-handoff (docs) — Built-in command and conversation pattern for escalating to a live agent.