Create web-based chat sessions with your assistants using the new Web Chat
integration with OpenAI Web Chat Requests
. Accept user input as strings or message arrays and manage conversations with session and customer information.
- Inworld TTS Voice Provider Integration: You can now customize which language
Inworld Voices
use likeEnglish
,Chinese
, andKorean
. You can also set the TTSmodel
and toggle voice caching withcachingEnabled
.

-
Additional Customer Information Fields: You can now include
email
andexternalId
fields when creating customers. You can also disable the E164 number format check withnumberE164CheckEnabled
– setting it tofalse
lets you use non-E164 numbers like1234
orabc
, useful for dialing non-standard numbers on SIP trunks. This lets you store extra contact information and link customers to external systems. -
schedulePlan
No Longer Required in Campaigns: You can now create campaigns without specifying aschedulePlan
.
Behavior Change: The Chat.assistantOverrides
property now only supports variable substitution in chat contexts, limiting its functionality compared to previous versions.
Custom Models, Enhanced Campaigns, and MCP Tool Improvements
Bring your own hosted LLMs and Google Gemini models to workflows with new WorkflowCustomModel
and WorkflowGoogleModel
objects. Control payload structure for advanced integrations and expand your model choices beyond OpenAI.

Gain deeper insight into campaign performance with new call counters including callsCounterQueued
, callsCounterScheduled
, callsCounterInProgress
, and callsCounterEndedVoicemail
for comprehensive campaign tracking.
Flexible tool integrations with new McpToolMetadata
field. Select between Server-Sent Events (sse
) or Streamable HTTP (shttp
) protocols for tool communication.
**Create support tickets directly through Vapi ** using the new /support/ticket
endpoint, simplifying how you request assistance.
-
Multilingual LMNT Voice Support: The
LMNTVoice
andFallbackLMNTVoice
objects now support alanguage
property (ISO 639-1 orauto
) for selecting or auto-detecting spoken language in synthesized voices. -
Assistant Overrides in Chats: The
assistantOverrides
property is now available inChat
,CreateChatDTO
, andOpenAIResponsesRequest
, allowing you to dynamically override assistant settings and template variables per chat session. -
New API Endpoints and Objects: Added
POST /workflow/generate
endpoint for workflow generation with tool IDs, plus new objects includingGenerateWorkflowDTO
and enhancedCreateMcpToolDTO
/UpdateMcpToolDTO
with metadata support. -
Include Messages in Server Response from Transfer Requests: When transferring calls, you can now include a
message
to communicate with users during the process withServerMessageResponse.message.message
.
Breaking Change: The 'aws-sts'
type is no longer supported in OAuth2AuthenticationPlan
. If you’re currently using this type in your OAuth2 authentication configurations, you’ll need to update it to avoid errors.
You can now use Inworld as a voice provider by selecting Inworld
in your configuration. You can also route your InWorld credentials under Settings > Integrations. Finally, there are new Call.endedReason
codes to help you better understand why calls ended due to Inworld voice issues.

-
HMAC Authentication for Webhook Credentials: Secure your webhooks with HMAC authentication by configuring
Assistant.credentials.authenticationPlan
withHMACAuthenticationPlan
, providing an alternative to OAuth2. -
Detailed Call End Reasons for Inworld Voice: New
endedReason
codes provide more insight when calls end due to Inworld voice issues.
Breaking Change: The codeSwitchingEnabled
property has been removed from Deepgram transcribers. If you’re currently using this property in your Deepgram transcriber configurations, you’ll need to remove it to avoid errors.
Org Concurrency Limit Deprecated: The concurrencyLimit
field in Org
, CreateOrgDTO
, and UpdateOrgDTO
is now marked as deprecated.
New Campaigns APIs and Assistant Improvements
Create, retrieve, and manage campaigns using the new /campaign
endpoints. Build automated call campaigns with specified customers and schedules.
General Availability: Assistant.modelOutputInMessagesEnabled
is now generally available without beta limitations. You can decide whether to use the model’s output in conversation history instead of the assistant’s speech transcription.
-
Simplified Assistant Property Structure: Properties like
serverMessages
,clientMessages
, andserverUrl
have been moved underAssistant.monitorPlan
. This reorganization simplifies how you configure monitoring for your assistants. -
Node-Level Overrides for Model and Voice: In
Conversation Node
, properties likemodel
,voice
, andtranscriber
now explicitly override the workflow’s settings. This allows you to customize these settings for individual nodes within a workflow for greater control. -
Enhanced Credential Configuration in Assistants: Assistants now support
credentials
andcredentialIds
, similar to workflows. This allows you to specify dynamic credentials specifically for assistant calls, enhancing security and flexibility. -
New Models Available in
ConversationNode
: You can now useGoogle Models
andCustom Models
in conversation nodes. This expands the range of language models that can be integrated into conversation nodes.

Workflow Configuration Enhancements and Assistant Updates
-
Workflow-Level Configuration of Voice and Plans: Developers can now configure
voice
,transcriber
, and various plans likemonitorPlan
andartifactPlan
at the workflow level. These configurations can still be overridden at the node level if needed. -
Use of Dynamic Credentials in Workflows: Workflows now support
credentials
andcredentialIds
, allowing you to specify dynamic credentials for workflow calls. This offers more flexibility in credential management, enabling credentials to be tied directly to specific workflows. -
backgroundDenoisingEnabled
Deprecated in Assistants: ThebackgroundDenoisingEnabled
property in Assistant is now deprecated. You should use the newAssistant.backgroundSpeechDenoisingPlan
to configure advanced background noise and speech denoising features.
Overview
- API Request Tool: You can now create API request tools that allow the assistant to make REST API calls during conversations

- Specify GCP Region: You can now specify the region for your GCP Credentials, This gives you control over where your call artifacts are stored.

- Transcriber Formatting Option: A new
formatTurns
option in yourAssembly AI Transcriber
that lets you enable or disable formatting of transcripts when using AssemblyAI’s Universal Streaming API. This helps you format transcript outputs to show speaker turns.
New Model Selection, Enhanced Edge Conditions, Simplified Credentials, and More
-
New Model Selection in Workflows: You can now specify the AI model used in workflows by setting the
model
property in workflow schemas. This allows choosing between OpenAI, Anthropic, Google, or custom models to better suit application requirements. -
Enhanced Workflow Edge Conditions: Workflows now support
Logic Edge Conditions
andFailed Edge Conditions
for edges. Specify logic edge conditions with Liquid JS templates to enable more complex logic and error handling within workflows, allowing for dynamic and responsive workflow designs. -
Simplified Credential Configuration: Your uploaded credentials are now automatically configured with the correct fallback index, simplifying the setup process with cloud providers.
-
Updated End Reasons for ElevenLabs: The following
endedReason
values been removed fromCall
:pipeline-error-eleven-labs-503-server-error
call.in-progress.error-providerfault-eleven-labs-503-server-error
You should update your error handling code to reflect the current set of possible end reasons.
Prompt Length Limitations: The globalPrompt
in workflows now has a maximum length of 5000 characters, and the liquid
property in LogicEdgeCondition
now has a maximum length of 1000 characters. Ensure prompts and conditions stay within these limits to prevent errors.
New Storage Credentials Providers
- New Storage Provider Credentials Added: You can now use new credential types
S3Credential
,GcpCredential
,AzureCredential
,SupabaseCredential
, andCloudflareCredential
to integrate with various storage services. This expands your options for storing data seamlessly across different providers.
Access to chat Object in Server Messages
- Access to
chat
Object in Server Messages: You can now access thechat
object within various server messages, providing additional context about the conversation.
Background Speech Denoising, Cartesia Transcriber, Workflow Enhancements, and Call Error Handling
- Background Speech Denoising Plan: You can now enhance call quality by configuring advanced background speech denoising options using the new
assistant.backgroundSpeechDenoisingPlan.smartDenoisingPlan
(default:false
), which replaces the previousbackgroundDenoisingEnabled
setting.
Use the SmartDenoisingPlan
to filter out background speech and noise using Krisp technology.
Fine-tune noise reduction with the new FourierDenoisingPlan
for more control over audio clarity.
Smart and Fourier denoising can be combined for optimal results. Order of precedence: Smart denoising, then Fourier denoising.
-
Workflow Server Property: Workflows now support a
server
property, allowing you to specify a server URL to receive webhook callbacks for workflow events directly. -
New Workflow Models: You can now integrate Google’s LLMs or custom models into your workflows by specifying
Google
orCustom LLM
in your workflow model settings. Select your model under Model Settings

-
Enhanced Error Reporting for Cartesia Services: A new
endedReason
valuepipeline-error-cartesia-502-server-error
has been added to help you identify and handle specific errors related to Cartesia server issues. -
Enhanced Error Handling and Status Enums: We’ve added new error enums and status codes to help you better handle and debug call-related issues:
-
VAPI Fault Errors: Detect specific VAPI-related errors during call start using
call.start.error-vapifault-get-org
andcall.start.error-vapifault-get-subscription
-
Subscription Status Errors: Identify subscription-related issues with new enums:
call.start.error-subscription-frozen
(replacesunknown-error
)call.start.error-subscription-insufficient-credits
-
Call Completion Statuses: Track how calls are completed with new enums:
call.in-progress.twilio-completed-call
call.in-progress.sip-completed-call
-
In-Call Error Detection: Handle specific errors during active calls using enums like
call.in-progress.error-vapifault-chat-pipeline-failed-to-start
-