Vapi

Type: low-code · Vendor: Vapi · Language: Web product / API · License: proprietary · Status: active · Status in practice: mature

Links: homepage docs

Hosted voice AI platform that orchestrates a transcriber, model and voice provider into a phone-callable assistant, with squads for multi-assistant handoff, function-calling tools and multilingual voice agents.

Description. Vapi is positioned as an orchestration layer over three pluggable modules — transcriber (STT), model (LLM) and voice (TTS) — plus a suite of real-time orchestration models for endpointing, interruption handling, backchanneling and emotion detection. Developers configure an Assistant, attach Tools (function calling, custom and code tools, transfer call), and optionally compose Squads where specialised assistants hand off to one another mid-call. Multilingual mode adds automatic language detection.

Agent loop shape. Hosted real-time pipeline. Caller audio flows into Vapi's transcriber; the LLM produces a response interleaved with tool calls; a chosen TTS speaks the result back; a suite of real-time models layered on top of STT/LLM/TTS handles endpointing, interruption, backchanneling and emotion. Squads compose multiple specialised assistants and route the live call between them via assistantDestinations.

Primary use cases

outbound and inbound phone agents
customer support and lead qualification at scale
appointment scheduling and reception
multi-assistant workflows via squads

flowchart TD caller[Caller] --> vapi[Vapi orchestration] vapi --> stt[Transcriber STT] stt --> llm[Model LLM] llm --> tool{Tool call?} tool -->|function| fn[Function / custom / code tool] tool -->|transfer| xfer[Transfer call] tool -->|none| tts[Voice TTS] fn --> llm tts --> caller rt[Real-time models: endpointing/interruption/backchannel/emotion] -.augment.-> stt rt -.augment.-> tts squad[Squad: assistantDestinations] -.routes.-> llm

Key concepts

Assistant (docs) — Configurable agent with transcriber, model, voice and tools.
Orchestration layer (docs) — Vapi sits over transcriber + model + voice and adds real-time models.
Squads → conversation-handoff (docs) — Multiple specialised assistants that hand off mid-call.
Tools → tool-use (docs) — Function calling, custom tools, code tools and call transfer.
Multilingual → multilingual-voice-agent (docs) — Multilingual assistants with automatic language detection.
Voice pipeline configuration (docs) — Endpointing, interruption handling, backchanneling, filler injection.

Vapi

Neighbourhood

Alternatives & relatives

Listed as alternative by (5)

References

Provenance