March 31, 2026

What’s New: October 2025 – March 2026

Here’s a summary of major items shipped from October 2025 through March 2026.

Squads v2: Visual builder to simplify sophisticated multi-assistant orchestration with seamless handoffs between specialized agents.
Composer (Alpha): Intelligent assistant inside the dashboard that allows you to describe what you need through plain text prompts to help build, adjust, and debug voice agents.
Simulations (Alpha): Voice agent testing feature to build confidence through enabling systematic, AI-powered testing in specific scenarios with evaluation of outcomes.
Monitoring & Issues: Automated call quality monitoring with trigger-based issue detection, alerting, and resolution suggestions.
HIPAA with Data Retention: New compliance mode with private storage and in-dashboard toggle/purchase flow — available for additional cost.
Zero Data Retention: Compliance mode that keeps context data during call as needed to execute tasks and retains no data afterwards.
Consolidated Logs: Unified log viewing into a single page.
Vapi Voices: 12 new ultra-realistic voices released, optimized for latency and cost with adjustable speed controls exposed. 8 legacy voices deprecated.

Deepgram Nova-3 Languages: Added Hebrew, Urdu, Tagalog, and Arabic bilingual support.
Cartesia Transcriber: ink-whisper.
Soniox: stt-rt-v4.

GPT-5 Family: OpenAI’s latest intelligence models, including GPT-5, 5-Mini, 5-Nano, 5.1, 5.2, 5.4, 5.4-Mini, 5.4-Nano.
Claude 4.5–4.6: Anthropic’s latest intelligence models Sonnet 4.5, Opus 4.5, Opus 4.6, Sonnet 4.6.
Gemini 3 Flash: Google’s latest intelligence models.
Grok 4 Fast: Reasoning and non-reasoning variants.
GPT Realtime Mini: OpenAI’s lightweight realtime model.

Cartesia: sonic-3, sonic-3-2026-01-12, sonic-3-2025-10-27.
WellSaid: Caruso (new), legacy.
Inworld: inworld-tts-1 (REST, original), inworld-tts-1.5-max (WebSocket, $10/M chars), inworld-tts-1.5-mini (WebSocket, $5/M chars).
ElevenLabs Scribe v2: Latest version of ElevenLabs speech-to-text.

Structured Outputs Improvements: Updates to our AI-powered analysis and data extraction tool, including transient structured outputs, audio-based extraction, and regex extraction.
SIP Request Tool + DTMF over SIP INFO: Send SIP requests and DTMF tones via SIP INFO messages during calls.
Variable Passing Between Tool Calls: Pass output variables from one tool call as input to subsequent tool calls.
Encrypted Tool Arguments: Encrypt sensitive tool arguments to protect data in transit.
Low Confidence Speech Hook: Hook that triggers when the transcriber returns low-confidence speech results.
Time Elapsed Hook: Hook that triggers at specified time intervals during a call.
assistant.speechStarted Event: New event fired when the assistant begins speaking.
MCP Improvements: Bearer auth, $ref dereferencing, child tool messages/discovery.
Warm Transfer Improvements: SIP support, caller ID, context engineering, variable filling.