ElevenLabs Conversational AI
also known as Eleven Agents
Type: full-code · Vendor: ElevenLabs · Language: API · License: proprietary · Status: active · Status in practice: mature
Hosted real-time voice agent stack from ElevenLabs that wires an ASR model, a configurable LLM, a low-latency TTS voice and a proprietary turn-taking model into a single managed conversational loop.
Description. ElevenLabs Conversational AI (rebranded Eleven Agents) is a hosted platform for building voice agents that listen, understand and act in real time across voice and chat. The pipeline pairs a fine-tuned ASR, the developer's choice of language model and a low-latency ElevenLabs TTS voice with a proprietary turn-taking model, and is extended via System Tools (built-in actions such as agent transfer, end call, language detection), Client Tools (browser/mobile actions) and Server Tools (HTTP calls to the developer's backend).
Agent loop shape. Hosted streaming voice loop: incoming audio is transcribed by a fine-tuned ASR, the user's selected LLM produces a response, ElevenLabs TTS streams audio back, and a proprietary turn-taking model drives endpointing and barge-in. Tool calls fan out via System / Client / Server tools; agent transfer is a system tool that hands the live conversation to another configured agent.
Primary use cases
- customer support voice agents across telephony and web
- multilingual voice assistants spanning ASR and TTS languages
- multi-agent workflows with hand-off to specialised ElevenLabs agents
- integration of CRM, calendar and payment backends via server tools
Key concepts
- Agent platform (docs) — ASR + LLM + TTS + turn-taking model packaged as one configurable agent.
- System tools → tool-use (docs) — Built-in actions that mutate conversation state without external calls (agent transfer, end call, language detection).
- Client tools → tool-use (docs) — Tools executed in the embedding browser or mobile app.
- Server tools → tool-use (docs) — HTTP-call tools that reach external systems and APIs.
- Agent transfer → conversation-handoff (docs) — Hand off an ongoing conversation to another designated ElevenLabs agent.
- Multilingual voices → multilingual-voice-agent (docs) — Voice library spanning 5k+ voices and 70+ languages used by the agent's TTS leg.
Patterns this full-code implements —
- ★★Conversation Handoff to Human
Agent-to-agent transfer is a documented system tool that moves the ongoing conversation to another designated ElevenLabs agent.
- ★Multilingual Voice Agent Stack
TTS leg ships 5k+ voices across 70+ languages; conversation page advertises real-time language detection and switching.
- ★★Tool Use
Three explicit tool categories: System tools (in-conversation state), Client tools (browser/mobile), Server tools (HTTP).
- ★★Stop / Cancel
Proprietary turn-taking model handles interruptions and turn boundaries; conversation-flow page exposes the knobs.
- ★★Streaming Typed Events
WebSocket protocol delivers typed real-time events between server and client — base64 audio chunks, transcripts, agent responses, interruption events, and character-level alignment data.
Neighbourhood
Click any neighbour to follow the lineage. Scroll to zoom, drag to pan.