What's New?

Subscribe to the latest product updates

Jul 20, 2026

What's New: Week of July 20, 2026

Model Intelligence: You can now set your assistant’s transcriber, llm, and voice models in one click with a Model Preset (choose between Balanced, High Intelligence, Ultra Fast, or Cost Saver), and see the latency, cost, and quality metrics for your chosen models so you can compare options and optimize with data.
Recording Download URLs in the End of Call Report: The end of call report now includes short-lived presigned download URLs for your call recordings and logs, so you can download them directly.
End of Call Reports with Zero Data Retention: End of call reports are now reliably delivered for transient assistants running under Zero Data Retention.

Jul 13, 2026

What's New: Week of July 13, 2026

Playback Speed for Recordings: Call recording players now include a playback-speed control.
Download Every Recording Type: You can now download every recording a call produced: mono, stereo, separate assistant and customer tracks, video, and packet capture.
VAD Transitions in Call Logs: The call log now surfaces voice-activity-detection transitions with a per-phase latency breakdown.
HIPAA Compliance: xAI is now HIPAA-compliant across its model, voice, and transcriber.

Jun 29, 2026

What's New: Week of June 29, 2026

OpenAI Realtime v2: OpenAI’s latest Realtime v2 model is now available for assistants.
AI-Generated Tool Failure and Completion Messages: You can now set role: 'system' on request-failed tool messages, and the dashboard has a new UI for configuring AI-generated messages when tools fail or complete. This gives assistants more natural responses when tools hit errors.
Call Logs Improvements:
- Significant latency improvements across both the dashboard and the /calls API endpoints.
- The call detail flyout now shows which assistant or squad handled the call, includes a link to the assistant or squad, and indicates which phone number was used.
- In squad or handoff calls, transcript messages now show which assistant said what, making it easier to trace conversation flow.
MCP Child Tools in Dashboard: When you connect an MCP server, the dashboard tool form now lists all child tools it discovers, so you can see exactly what capabilities your MCP server exposes.

Jun 22, 2026

What's New: Week of June 22, 2026

Gemini 3.5 Flash and 3.1 Flash-Lite: Google’s Gemini 3.5 Flash and 3.1 Flash-Lite models are now available for assistants.
Soniox stt-rt-v5: A new stt-rt-v5 real-time speech-to-text model is available for the Soniox transcriber.
Discord Login Removed: Discord is no longer offered as a login or signup option.
Variable Values in Handoff Webhooks: After an assistant handoff, webhook payloads now include the assistant’s configured variableValues.

Jun 15, 2026

What's New: Week of June 15, 2026

Claude Haiku (Global): Now available as a model through Amazon Bedrock for assistants.
Pronunciation Dictionary Management in Voice Config: Pronunciation dictionaries configured via the API can now be viewed and managed directly in an assistant’s voice settings in the dashboard.
- Stale dictionary references are flagged when a voice changes to a model that cannot apply them.
Rotating Tool Messages: You can now configure multiple message variants for a tool, and the assistant picks one at random so longer calls feel less repetitive.
Dynamic Variables in Test Calls: A dialog lets you set dynamic variable values before starting a test call from the dashboard.
Concurrency and Rate Limits in Organization Settings: Your call concurrency cap and API request rate limit now appear as read-only fields in Organization Settings.
Fixes and Improvements:
- Cartesia voice overrides in squads now apply correctly instead of falling back to a hard-coded default.
- Duplicate tools sharing the same function.name are de-duplicated during model streaming, preventing duplicate tool calls.
- The call concurrency chart in analytics now renders correctly.

Jun 1, 2026

What's New: Week of June 1, 2026

xAI Speech-to-Text and Text-to-Speech: xAI is now available as a transcriber (STT) and voice (TTS) provider for assistants.
Upgraded Vapi Voices: A new text-to-speech model powering Vapi Voices makes them sound more authentic, human, and consistent — at ~50% lower cost.
- Existing deployments don’t change automatically — opt in by setting version: 2 on the voice configuration via the API or Dashboard. See Vapi Voices for supported voices and audio samples.
Pronunciation Dictionaries in the Dashboard: The Assistants view now supports creating new pronunciation dictionaries directly from the dashboard.
Phone Number Fixes: Improvements to phone number creation and listing.
- The Phone Numbers list no longer breaks on rows with no number or SIP URI.
- Creating a phone number now validates its Vapi identifier up front.

May 25, 2026

What's New: Week of May 25, 2026

Dashboard Performance: Front-end infrastructure improvements for faster page loads and a snappier feel across the dashboard.

May 18, 2026

What's New: Week of May 18, 2026

Responsive UI Polish: A round of UI adjustments to make the app work better across viewport sizes.
New Composer-Based Onboarding Flow: A new onboarding experience built on top of the Assistant Builder and powered by Composer is rolling out to select users as part of a phased release.

May 11, 2026

What's New: Week of May 11, 2026

New Assistant Builder Experience: An updated, streamlined assistant configuration experience, now available to all users.
- UI optimizations for the Phone Numbers page were also made to align its look and interactions with the new experience.

May 4, 2026

What's New: Week of May 4, 2026

Soniox — General Availability: The Soniox transcriber is now rolled out to all customers. Configure it on any assistant via assistant.transcriber (provider: soniox) for low-latency, multilingual real-time speech-to-text.

Apr 27, 2026

What's New: Week of April 27, 2026

Deepgram Flux — Multilingual Support: Full support for Deepgram’s multilingual Flux model. Multilingual agents can now leverage the same smart turn-taking that powers the English Flux transcriber, making cross-lingual conversations feel more fluid and natural.

Apr 20, 2026

What's New: Week of April 20, 2026

Logs UX Refresh: New filter layout plus a round of UX improvements — improved date picker, active row is clearly highlighted across all log views when the flyout is opened, log tables are fully keyboard-accessible, sortable cost and duration columns, pagination, and more.
Squads contextEngineeringPlan Handoff Type — previousAssistantMessages: Forwards only the conversation history from before the current assistant’s session. The current assistant’s own messages and tool calls are excluded entirely from the handoff payload. See the updated handoff context configuration docs.
assistant.speechStarted Event — Live Captions & Word-Level Timing (GA): A new opt-in message fires as the assistant begins speaking each segment, carrying the full turn text, turn, source (model / force-say / custom-voice), and optional timing:
- Per-word alignment on ElevenLabs
- Cursor-based word-progress on Minimax (set voice.subtitleType: "word", with correct CJK handling)
- Text-only fallback on all other providers
Subscribe by adding "assistant.speechStarted" to your assistant’s clientMessages and/or serverMessages — now GA with no feature flag. Use it for live captions, karaoke-style highlighting, or any UI that needs to stay in sync with assistant audio. Fully backward-compatible; no existing messages changed.
Autofallbacks on Transcribers: Let Vapi pick the best transcriber to fall back to if your primary one fails — even mid-call. Opt in by setting assistant.transcriber.fallbackPlan.autoFallback.enabled to true. See the updated transcriber fallback plan docs.

Apr 13, 2026

What's New: Week of April 13, 2026

Monitoring — GA: Automated call quality monitoring is now generally available. Detect issues with trigger-based rules, get alerts when something goes wrong, and surface resolution suggestions — all from the dashboard.
- Monitoring quickstart
- Announcement blog post

Mar 31, 2026

What's New: October 2025 – March 2026

Here’s a summary of major items shipped from October 2025 through March 2026.

Sep 29, 2025

Breaking Changes & API Cleanup

Legacy Endpoint Removal: The following deprecated endpoints have been removed as part of our API modernization effort:
- /logs - Use call artifacts and monitoring instead
- /workflow/{id} - Access workflows through the main workflow endpoints
- /test-suite and related paths - Replaced by the new evaluation system
- /knowledge-base and related paths - Integrated into model configurations
Knowledge Base Architecture Change: The knowledgeBaseId property has been removed from all model configurations. This affects:
- XaiModel, GroqModel, GoogleModel
- OpenAIModel, AnthropicModel, CustomLLMModel
- All other model provider configurations
Transcriber Property Deprecation: AssemblyAITranscriber.wordFinalizationMaxWaitTime and FallbackAssemblyAITranscriber.wordFinalizationMaxWaitTime are now deprecated:
- Use smart endpointing plans for better speech timing control
- More precise conversation flow management
- Enhanced end-of-turn detection capabilities
Schema Path Cleanup: Removed numerous unused schema paths from model configurations to simplify the API structure and improve performance. This cleanup affects internal schema references but doesn’t impact your existing integrations.
New v2 API: We are introducing a new API version v2. These changes are part of our ongoing effort to:
- Simplify the API structure for better developer experience
- Remove redundant and deprecated functionality
- Complete the transition to new evaluation and compliance systems
- Improve API performance and maintainability

Sep 28, 2025

Evaluation Execution & Results Processing

Evaluation Execution Engine: Run comprehensive assistant evaluations with EvalRun and CreateEvalRunDTO. Execute your mock conversations against live assistants and squads to validate performance and behavior in controlled environments.
Multiple Evaluation Models: Choose from various AI models for LLM-as-a-judge evaluation:
- EvalOpenAIModel: GPT models including GPT-4.1, o1-mini, o3, and regional variants
- EvalAnthropicModel: Claude models with optional thinking features for complex evaluations
- EvalGoogleModel: Gemini models from 1.0 Pro to 2.5 Pro for diverse evaluation needs
- EvalGroqModel: High-speed inference models including Llama and custom options
- EvalCustomModel: Your own evaluation models with custom endpoints
Evaluation Results: Comprehensive result tracking with EvalRunResult:
- status: Pass/fail evaluation outcomes
- messages: Complete conversation transcript from the evaluation
- startedAt and endedAt: Precise timing information for performance analysis
Target Flexibility: Run evaluations against different targets:
- EvalRunTargetAssistant: Test individual assistants with optional overrides
- EvalRunTargetSquad: Evaluate entire squad performance and coordination
Evaluation Status Tracking: Monitor evaluation progress with detailed status information:
- running: Evaluation in progress
- ended: Evaluation completed
- queued: Evaluation waiting to start
- Detailed endedReason including success, error, timeout, and cancellation states
Judge Configuration: Optimize evaluation accuracy with model-specific settings:
- maxTokens: Recommended 50-10000 tokens (1 token for simple pass/fail responses)
- temperature: 0-0.3 recommended for LLM-as-a-judge to reduce hallucinations

Sep 26, 2025

Voicemail Detection & Handling Improvements

Enhanced Beep Detection: Improve voicemail detection accuracy with CreateVoicemailToolDTO.beepDetectionEnabled specifically for Twilio-based calls. This feature detects the characteristic beep sound that indicates voicemail recording has started.
Workflow Voicemail Integration: Configure comprehensive voicemail handling in workflows with enhanced message and detection capabilities:
- Workflow.voicemailMessage: Custom messages for voicemail scenarios (up to 1000 characters)
- Workflow.voicemailDetection: Configurable detection methods for different providers
Assistant Voicemail Enhancement: Improved voicemail handling in assistant configurations with Assistant.voicemailMessage and Assistant.voicemailDetection for consistent behavior across all conversation types.
Multiple Detection Methods: Choose from various voicemail detection providers:
- Google: GoogleVoicemailDetectionPlan for AI-powered detection
- OpenAI: OpenAIVoicemailDetectionPlan for intelligent voicemail recognition
- Twilio: TwilioVoicemailDetectionPlan for carrier-level detection
- Vapi: VapiVoicemailDetectionPlan for integrated detection
Beep Detection for Call Flows: The new beep detection capability works specifically with Twilio transport, providing reliable voicemail identification when traditional detection methods may not be sufficient.
Voicemail Tool Configuration: Enhanced tool rejection and messaging capabilities ensure appropriate handling when voicemail is detected, with configurable responses based on your business requirements.

Sep 23, 2025

Advanced Analytics & Variable Grouping

Variable Value Analytics: Gain deeper insights into your assistant performance with AnalyticsQuery.groupByVariableValue. Group analytics data by specific variable values extracted during calls for granular performance analysis.
Enhanced Grouping Options: Use VariableValueGroupBy to specify custom grouping criteria:
- key: The variable value key to group by (up to 100 characters)
- Combine with existing grouping options like assistantId, endedReason, and status
Multi-Dimensional Analysis: Create complex analytics queries by combining traditional grouping fields with variable values:
- Group by assistant performance AND custom business metrics
- Analyze conversation outcomes by extracted data points
- Track success rates across different variable value segments
Advanced Query Capabilities: Enhanced AnalyticsQuery functionality enables sophisticated data analysis:
- Multiple grouping dimensions for comprehensive insights
- Variable-based segmentation for business intelligence
- Custom metric tracking through extracted call variables
Business Intelligence Integration: Connect your call data to business outcomes by grouping analytics on:
- Customer satisfaction scores extracted from calls
- Product interest levels determined during conversations
- Lead qualification status gathered through assistant interactions
- Custom KPIs specific to your business logic

Sep 20, 2025

Chat Transport & SMS Integration

Twilio SMS Transport: Send chat responses directly via SMS using TwilioSMSChatTransport in CreateChatDTO.transport. This enables programmatic SMS conversations with your voice assistants, bridging the gap between voice and text communication.
SMS Session Management: Create new sessions automatically when using SMS transport by providing:
- customer: Customer information for SMS delivery
- phoneNumberId: SMS-enabled phone number from your organization
- Automatic session creation when both fields are provided
LLM-Generated vs Direct SMS: Control message processing with TwilioSMSChatTransport.useLLMGeneratedMessageForOutbound:
- true (default): Input processed by assistant for intelligent responses
- false: Direct message forwarding without LLM processing for notifications and alerts
Enhanced Chat Creation: CreateChatDTO now supports sophisticated session management:
- transport: SMS delivery configuration
- sessionId: Use existing session data
- Mutual exclusivity between sessionId and transport fields for clear session boundaries
OpenAI Responses Integration: Streamlined chat processing with OpenAIResponsesRequest supporting the same transport and squad integration features for consistent API experience.
Cross-Platform Continuity: Seamlessly transition between voice calls and SMS conversations within the same session, maintaining context and conversation history across communication channels.

Sep 17, 2025

API Versioning & Infrastructure Updates

API Version 2 Introduction: Access enhanced functionality through new versioned endpoints while maintaining full backward compatibility:
- /v2/call: Enhanced call management with new features and improved response formats
- /v2/phone-number: Advanced phone number management with extended capabilities
Enhanced Pagination: Improved pagination controls across all endpoints with PaginationMeta enhancements:
- createdAtGe and createdAtLe: Date range filtering for creation timestamps
- Better sorting and filtering options for large datasets
- Enhanced metadata for pagination state management
Workflow Message Configuration: Customize voicemail handling in workflows with CreateWorkflowDTO.voicemailMessage and CreateWorkflowDTO.voicemailDetection for comprehensive call flow management.
Credential Integration: Seamless credential management across all workflow and assistant configurations with enhanced credentials.items.discriminator.mapping.custom-credential support.
Transport Infrastructure: Foundation for advanced communication channels with improved transport configuration and management capabilities.