What’s New: Week of April 20, 2026
What’s New: Week of April 20, 2026
-
Logs UX Refresh: New filter layout plus a round of UX improvements — improved date picker, active row is clearly highlighted across all log views when the flyout is opened, log tables are fully keyboard-accessible, sortable
costanddurationcolumns, pagination, and more. -
Squads
contextEngineeringPlanHandoff Type —previousAssistantMessages: Forwards only the conversation history from before the current assistant’s session. The current assistant’s own messages and tool calls are excluded entirely from the handoff payload. See the updated handoff context configuration docs. -
assistant.speechStartedEvent — Live Captions & Word-Level Timing (GA): A new opt-in message fires as the assistant begins speaking each segment, carrying the full turn text,turn,source(model/force-say/custom-voice), and optional timing:- Per-word alignment on ElevenLabs
- Cursor-based word-progress on Minimax (set
voice.subtitleType: "word", with correct CJK handling) - Text-only fallback on all other providers
Subscribe by adding
"assistant.speechStarted"to your assistant’sclientMessagesand/orserverMessages— now GA with no feature flag. Use it for live captions, karaoke-style highlighting, or any UI that needs to stay in sync with assistant audio. Fully backward-compatible; no existing messages changed. -
Autofallbacks on Transcribers: Let Vapi pick the best transcriber to fall back to if your primary one fails — even mid-call. Opt in by setting
assistant.transcriber.fallbackPlan.autoFallback.enabledtotrue. See the updated transcriber fallback plan docs.
What’s New: Week of April 13, 2026
What’s New: Week of April 13, 2026
-
Monitoring — GA: Automated call quality monitoring is now generally available. Detect issues with trigger-based rules, get alerts when something goes wrong, and surface resolution suggestions — all from the dashboard.
What’s New: October 2025 – March 2026
What’s New: October 2025 – March 2026
Here’s a summary of major items shipped from October 2025 through March 2026.
Platform
-
Squads v2: Visual builder to simplify sophisticated multi-assistant orchestration with seamless handoffs between specialized agents.
-
Composer (Alpha): Intelligent assistant inside the dashboard that allows you to describe what you need through plain text prompts to help build, adjust, and debug voice agents.
-
Simulations (Alpha): Voice agent testing feature to build confidence through enabling systematic, AI-powered testing in specific scenarios with evaluation of outcomes.
-
Monitoring & Issues: Automated call quality monitoring with trigger-based issue detection, alerting, and resolution suggestions.
-
HIPAA with Data Retention: New compliance mode with private storage and in-dashboard toggle/purchase flow — available for additional cost.
-
Zero Data Retention: Compliance mode that keeps context data during call as needed to execute tasks and retains no data afterwards.
-
Consolidated Logs: Unified log viewing into a single page.
-
Vapi Voices: 12 new ultra-realistic voices released, optimized for latency and cost with adjustable speed controls exposed. 8 legacy voices deprecated.
New Models & Provider Support
Transcriber Models (Speech-to-Text)
-
Deepgram Nova-3 Languages: Added Hebrew, Urdu, Tagalog, and Arabic bilingual support.
-
Cartesia Transcriber: ink-whisper.
-
Soniox: stt-rt-v4.
Intelligence Models (LLM)
-
GPT-5 Family: OpenAI’s latest intelligence models, including GPT-5, 5-Mini, 5-Nano, 5.1, 5.2, 5.4, 5.4-Mini, 5.4-Nano.
-
Claude 4.5–4.6: Anthropic’s latest intelligence models Sonnet 4.5, Opus 4.5, Opus 4.6, Sonnet 4.6.
-
Gemini 3 Flash: Google’s latest intelligence models.
-
Grok 4 Fast: Reasoning and non-reasoning variants.
-
GPT Realtime Mini: OpenAI’s lightweight realtime model.
Voice Models (Text-to-Speech)
-
Cartesia: sonic-3, sonic-3-2026-01-12, sonic-3-2025-10-27.
-
WellSaid: Caruso (new), legacy.
-
Inworld: inworld-tts-1 (REST, original), inworld-tts-1.5-max (WebSocket, $10/M chars), inworld-tts-1.5-mini (WebSocket, $5/M chars).
-
ElevenLabs Scribe v2: Latest version of ElevenLabs speech-to-text.
Developer Tools & API
-
Structured Outputs Improvements: Updates to our AI-powered analysis and data extraction tool, including transient structured outputs, audio-based extraction, and regex extraction.
-
SIP Request Tool + DTMF over SIP INFO: Send SIP requests and DTMF tones via SIP INFO messages during calls.
-
Variable Passing Between Tool Calls: Pass output variables from one tool call as input to subsequent tool calls.
-
Encrypted Tool Arguments: Encrypt sensitive tool arguments to protect data in transit.
-
Low Confidence Speech Hook: Hook that triggers when the transcriber returns low-confidence speech results.
-
Time Elapsed Hook: Hook that triggers at specified time intervals during a call.
-
assistant.speechStarted Event: New event fired when the assistant begins speaking.
-
MCP Improvements: Bearer auth, $ref dereferencing, child tool messages/discovery.
-
Warm Transfer Improvements: SIP support, caller ID, context engineering, variable filling.
Breaking Changes & API Cleanup
Breaking Changes & API Cleanup
-
Legacy Endpoint Removal: The following deprecated endpoints have been removed as part of our API modernization effort:
/logs- Use call artifacts and monitoring instead/workflow/{id}- Access workflows through the main workflow endpoints/test-suiteand related paths - Replaced by the new evaluation system/knowledge-baseand related paths - Integrated into model configurations
-
Knowledge Base Architecture Change: The
knowledgeBaseIdproperty has been removed from all model configurations. This affects:XaiModel,GroqModel,GoogleModelOpenAIModel,AnthropicModel,CustomLLMModel- All other model provider configurations
-
Transcriber Property Deprecation:
AssemblyAITranscriber.wordFinalizationMaxWaitTimeandFallbackAssemblyAITranscriber.wordFinalizationMaxWaitTimeare now deprecated:- Use smart endpointing plans for better speech timing control
- More precise conversation flow management
- Enhanced end-of-turn detection capabilities
-
Schema Path Cleanup: Removed numerous unused schema paths from model configurations to simplify the API structure and improve performance. This cleanup affects internal schema references but doesn’t impact your existing integrations.
-
New v2 API: We are introducing a new API version v2. These changes are part of our ongoing effort to:
- Simplify the API structure for better developer experience
- Remove redundant and deprecated functionality
- Complete the transition to new evaluation and compliance systems
- Improve API performance and maintainability
For details on the new features that replace these deprecated endpoints, see our recent changelog entries:
- Enhanced Authentication & Custom Credentials (Aug 30)
- Recording Consent & Compliance Management (Sep 2)
- Evaluation System Foundation (Sep 5)
- Evaluation Execution & Results Processing (Sep 28)
If you’re currently using any of the removed endpoints or properties, you must migrate to the new alternatives before this release. Contact support if you need assistance with migration strategies.
Migration Guide
Replace /logs endpoint usage with call artifacts, monitoring plans, and end-of-call reports for comprehensive logging.
Migrate from test-suite endpoints to the new evaluation system with mock conversations and comprehensive result tracking.
Update model configurations to use the integrated knowledge base system instead of separate knowledgeBaseId references.
Replace deprecated transcriber timing properties with smart endpointing plans for better conversation flow control.
Removed Endpoints
The following endpoints are no longer available:
GET /logs- Use call artifacts insteadGET /workflow/{id}- Use main workflow endpointsGET /test-suite,POST /test-suite- Use evaluation endpointsGET /test-suite/{id},PUT /test-suite/{id},DELETE /test-suite/{id}- Use evaluation managementPOST /test-suite/{testSuiteId}/run- Use evaluation runsGET /knowledge-base,POST /knowledge-base- Integrated into model configurations- All related nested endpoints and operations
See Also:
- Authentication System Updates (Aug 30) - For credential management migration
- Recording Consent Features (Sep 2) - For compliance system details
- Enhanced Transcription (Sep 8) - For AssemblyAI timing alternatives
Evaluation Execution & Results Processing
Evaluation Execution & Results Processing
-
Evaluation Execution Engine: Run comprehensive assistant evaluations with
EvalRunandCreateEvalRunDTO. Execute your mock conversations against live assistants and squads to validate performance and behavior in controlled environments. -
Multiple Evaluation Models: Choose from various AI models for LLM-as-a-judge evaluation:
EvalOpenAIModel: GPT models including GPT-4.1, o1-mini, o3, and regional variantsEvalAnthropicModel: Claude models with optional thinking features for complex evaluationsEvalGoogleModel: Gemini models from 1.0 Pro to 2.5 Pro for diverse evaluation needsEvalGroqModel: High-speed inference models including Llama and custom optionsEvalCustomModel: Your own evaluation models with custom endpoints
-
Evaluation Results: Comprehensive result tracking with
EvalRunResult:status: Pass/fail evaluation outcomesmessages: Complete conversation transcript from the evaluationstartedAtandendedAt: Precise timing information for performance analysis
-
Target Flexibility: Run evaluations against different targets:
EvalRunTargetAssistant: Test individual assistants with optional overridesEvalRunTargetSquad: Evaluate entire squad performance and coordination
-
Evaluation Status Tracking: Monitor evaluation progress with detailed status information:
running: Evaluation in progressended: Evaluation completedqueued: Evaluation waiting to start- Detailed
endedReasonincluding success, error, timeout, and cancellation states
-
Judge Configuration: Optimize evaluation accuracy with model-specific settings:
maxTokens: Recommended 50-10000 tokens (1 token for simple pass/fail responses)temperature: 0-0.3 recommended for LLM-as-a-judge to reduce hallucinations
For LLM-as-a-judge evaluations, the judge model must respond with exactly “pass” or “fail”. Design your evaluation prompts to ensure clear, deterministic responses.
Evaluation Capabilities
Choose from OpenAI, Anthropic, Google, Groq, or custom models for evaluation, matching your quality and performance requirements.
Detailed pass/fail results with complete conversation transcripts and timing information for thorough analysis.
Test individual assistants or entire squads with optional configuration overrides for comprehensive validation.
Real-time evaluation status tracking with detailed reason codes for failures, timeouts, and cancellations.
Voicemail Detection & Handling Improvements
Voicemail Detection & Handling Improvements
-
Enhanced Beep Detection: Improve voicemail detection accuracy with
CreateVoicemailToolDTO.beepDetectionEnabledspecifically for Twilio-based calls. This feature detects the characteristic beep sound that indicates voicemail recording has started. -
Workflow Voicemail Integration: Configure comprehensive voicemail handling in workflows with enhanced message and detection capabilities:
Workflow.voicemailMessage: Custom messages for voicemail scenarios (up to 1000 characters)Workflow.voicemailDetection: Configurable detection methods for different providers
-
Assistant Voicemail Enhancement: Improved voicemail handling in assistant configurations with
Assistant.voicemailMessageandAssistant.voicemailDetectionfor consistent behavior across all conversation types. -
Multiple Detection Methods: Choose from various voicemail detection providers:
- Google:
GoogleVoicemailDetectionPlanfor AI-powered detection - OpenAI:
OpenAIVoicemailDetectionPlanfor intelligent voicemail recognition - Twilio:
TwilioVoicemailDetectionPlanfor carrier-level detection - Vapi:
VapiVoicemailDetectionPlanfor integrated detection
- Google:
-
Beep Detection for Call Flows: The new beep detection capability works specifically with Twilio transport, providing reliable voicemail identification when traditional detection methods may not be sufficient.
-
Voicemail Tool Configuration: Enhanced tool rejection and messaging capabilities ensure appropriate handling when voicemail is detected, with configurable responses based on your business requirements.
Beep detection is currently available only for Twilio-based calls. If you’re using other providers, consider combining multiple detection methods for better accuracy.
Voicemail Management Features
Support for Google, OpenAI, Twilio, and Vapi detection methods, allowing you to choose the best option for your use case.
Advanced audio analysis to detect voicemail beeps on Twilio calls for more reliable voicemail identification.
Configure personalized voicemail messages up to 1000 characters for better user experience and brand consistency.
Comprehensive voicemail handling throughout workflow nodes with consistent configuration across conversation flows.
Advanced Analytics & Variable Grouping
Advanced Analytics & Variable Grouping
-
Variable Value Analytics: Gain deeper insights into your assistant performance with
AnalyticsQuery.groupByVariableValue. Group analytics data by specific variable values extracted during calls for granular performance analysis. -
Enhanced Grouping Options: Use
VariableValueGroupByto specify custom grouping criteria:key: The variable value key to group by (up to 100 characters)- Combine with existing grouping options like
assistantId,endedReason, andstatus
-
Multi-Dimensional Analysis: Create complex analytics queries by combining traditional grouping fields with variable values:
- Group by assistant performance AND custom business metrics
- Analyze conversation outcomes by extracted data points
- Track success rates across different variable value segments
-
Advanced Query Capabilities: Enhanced
AnalyticsQueryfunctionality enables sophisticated data analysis:- Multiple grouping dimensions for comprehensive insights
- Variable-based segmentation for business intelligence
- Custom metric tracking through extracted call variables
-
Business Intelligence Integration: Connect your call data to business outcomes by grouping analytics on:
- Customer satisfaction scores extracted from calls
- Product interest levels determined during conversations
- Lead qualification status gathered through assistant interactions
- Custom KPIs specific to your business logic
Variable values are extracted during calls using tool response schemas and aliases. Set up variable extraction in your tools to enable powerful analytics grouping based on conversation outcomes.
Analytics Enhancements
Group analytics by any variable extracted during calls, enabling business-specific performance insights and KPI tracking.
Combine traditional call metrics with custom variable grouping for comprehensive conversation analysis.
Connect call performance to business outcomes through variable-based analytics and custom grouping options.
Create detailed reports by grouping on extracted conversation data like satisfaction scores, intent categories, or custom business metrics.
Chat Transport & SMS Integration
Chat Transport & SMS Integration
-
Twilio SMS Transport: Send chat responses directly via SMS using
TwilioSMSChatTransportinCreateChatDTO.transport. This enables programmatic SMS conversations with your voice assistants, bridging the gap between voice and text communication. -
SMS Session Management: Create new sessions automatically when using SMS transport by providing:
customer: Customer information for SMS deliveryphoneNumberId: SMS-enabled phone number from your organization- Automatic session creation when both fields are provided
-
LLM-Generated vs Direct SMS: Control message processing with
TwilioSMSChatTransport.useLLMGeneratedMessageForOutbound:true(default): Input processed by assistant for intelligent responsesfalse: Direct message forwarding without LLM processing for notifications and alerts
-
Enhanced Chat Creation:
CreateChatDTOnow supports sophisticated session management:transport: SMS delivery configurationsessionId: Use existing session data- Mutual exclusivity between
sessionIdand transport fields for clear session boundaries
-
OpenAI Responses Integration: Streamlined chat processing with
OpenAIResponsesRequestsupporting the same transport and squad integration features for consistent API experience. -
Cross-Platform Continuity: Seamlessly transition between voice calls and SMS conversations within the same session, maintaining context and conversation history across communication channels.
SMS transport requires SMS-enabled phone numbers in your organization. The phone number must support SMS functionality and belong to your account for successful message delivery.
SMS Communication Features
Send and receive SMS messages through your voice assistant, enabling text-based interactions alongside voice conversations.
Choose between AI-processed responses and direct message forwarding based on your use case requirements.
Maintain conversation context across SMS and voice interactions within unified sessions for seamless user experiences.
Automatic session creation and management when using transport fields, simplifying SMS conversation setup.
API Versioning & Infrastructure Updates
API Versioning & Infrastructure Updates
-
API Version 2 Introduction: Access enhanced functionality through new versioned endpoints while maintaining full backward compatibility:
/v2/call: Enhanced call management with new features and improved response formats/v2/phone-number: Advanced phone number management with extended capabilities
-
Enhanced Pagination: Improved pagination controls across all endpoints with
PaginationMetaenhancements:createdAtGeandcreatedAtLe: Date range filtering for creation timestamps- Better sorting and filtering options for large datasets
- Enhanced metadata for pagination state management
-
Workflow Message Configuration: Customize voicemail handling in workflows with
CreateWorkflowDTO.voicemailMessageandCreateWorkflowDTO.voicemailDetectionfor comprehensive call flow management. -
Credential Integration: Seamless credential management across all workflow and assistant configurations with enhanced
credentials.items.discriminator.mapping.custom-credentialsupport. -
Transport Infrastructure: Foundation for advanced communication channels with improved transport configuration and management capabilities.
Version 2 endpoints provide enhanced features while v1 endpoints remain fully functional. Migrate to v2 when you need access to new capabilities or improved performance characteristics.
Infrastructure Improvements
Existing v1 endpoints continue to work unchanged, ensuring smooth transitions and zero downtime for existing integrations.
Improved date range filtering and pagination controls for better data management and API performance.
Enhanced workflow configuration with better voicemail handling and credential management throughout the call flow.
Foundation for advanced features and capabilities that will be built on the v2 API structure.
Squad Management & Session Enhancement
Squad Management & Session Enhancement
-
Squad-Based Sessions: Organize your assistants into collaborative teams with
Session.squadandSession.squadId. Sessions can now be associated with squads for team-based conversation management and coordinated assistant behavior. -
Squad Chat Integration: Enable squad-based chat conversations using
Chat.squadandChat.squadId. This allows multiple assistants to participate in or be aware of chat contexts for more sophisticated conversation handling. -
Enhanced Session Creation: Create squad-enabled sessions with
CreateSessionDTO.squadandCreateSessionDTO.squadId, enabling persistent conversation contexts across multiple assistants and interaction types. -
Chat Management by Squad: Filter and organize chats by squad membership using
GetChatPaginatedDTO.squadIdfor better conversation management and team-based analytics. -
Session Management by Squad: Query sessions by squad association with
GetSessionPaginatedDTO.squadId, providing team-based session organization and management capabilities. -
Full Message History: Control conversation context retention with
ArtifactPlan.fullMessageHistoryEnabled. When enabled, artifacts contain complete message history even after handoff context engineering, preserving full conversation flow for analysis. -
Transfer Records: Track warm transfer details with
Artifact.transfers, providing comprehensive records of transfer destinations, transcripts, and status information for multi-assistant conversations.
Squad management enables sophisticated multi-assistant workflows where different specialists can handle different parts of a conversation while maintaining shared context and coordination.
Team Collaboration Features
Enable multiple assistants to work together within squads for specialized conversation handling and seamless handoffs.
Maintain conversation context across squad members and session boundaries for continuous conversation experiences.
Filter conversations, sessions, and analytics by squad membership for team-based performance insights and management.
Track all transfers and handoffs with detailed records including destinations, transcripts, and status information.