- Azure Region Renamed to
swedencentral
(from sweden): Azure Speech Services customers using the Sweden data center should now specifyswedencentral
as your Azure Speech Services region instead ofsweden
. Update your region in your code and the updated provider keys page > Azure Speech.
Changelog
- Removal of
'gemma-7b-it'
fromGroqModel
Options: The'gemma-7b-it'
model is no longer available when selecting Groq as a model provider. Update your applications to use other valid options provided by the API.
Refer to the GroqModel
schema or the vapi dashboard for Groq for a list of supported models.
- Azure Speech Transcriber Support: You can now use Azure’s speech-to-text service by specifying
AzureSpeechTranscriber
as an option fortranscriber
. This allows you to leverage Azure’s speech to text capabilities when creating or updating your assistant.
Refer to our api docs to learn more.
-
Use OpenAI Chat Completions in your Assistant: you can now more easily integrate your Assistant with OpenAI’s chat completions sessions by specifying
messages
(an array ofOpenAIMessage
objects) and anassistantId
(a string). EachOpenAIMessage
in turn consists of acontent
(a string between 1 and 100,000,000 characters) and a role (between assistant, function, user, system, tool). This makes it easier to manage chat sessions associated with a specific assistant. Refer to theChatDTO
,OpenAIMessage
schemas in our API docs to learn more. -
Update Subscription Email on Billing Page: you can now customize which email address appears on your Vapi invoices through the updated billing page > under payment history. You can specify an email address (in addition through physical address and tax id) - read more in our docs.
- Claude Computer Use Tools Available: You can now use Claude computer use tools like
BashTool
,ComputerTool
, andTextEditorTool
when building your Vapi assistant. Create these tools withCreateBashToolDTO
(enables shell command execution),CreateComputerToolDTO
(use desktop functionality with customizable display dimensions usingdisplayWidthPx
,displayHeightPx
), andCreateTextEditorToolDTO
(text editing operations), respectively.
Refer to our API docs to learn more about how to use Claude computer use tools.
- Improved Tavus Video Processing Error Messages: Your call
endedReason
now includes detailed error messages forpipeline-error-tavus-video-failed
. Use this to detect and manage scenarios where the Tavus video processing pipeline fails during a call.
- OAuth 2 Authentication for Custom LLM Models and Webhooks: In addition to (AuthZ)[https://www.okta.com/identity-101/authentication-vs-authorization/], you can now now authenticate users accessing your custom LLMs and server urls (aka webhooks) using OAuth2 (RFC 6749). Use the
authenticationSession
dictionary which contains anaccessToken
andexpiresAt
datetime to authenticate further requests to your custom LLM or server URL.
For example, create a webhook credential with CreateCustomLLMCredentialDTO
with the following payload:
This returns a CustomLLMCredential
object as follows:
This can be used to authenticate successive requests to your custom LLM or server URL.
- OAuth2 Support for Custom LLM Credentials and Webhooks: You can now authorize access to your custom LLMs and server urls (aka webhooks) using OAuth2 (RFC 6749).
For example, create a webhook credential with CreateWebhookCredentialDTO
with the following payload:
This returns a WebhookCredential
object as follows:
- Removal of Canonical Knowledge Base: The ability to create, update, and use canoncial knowledge bases in your assistant has been removed from the API(as custom knowledge bases and the Trieve integration supports as superset of this functionality). Please update your implementations as endpoints and models referencing canoncial knowledge base schemas are no longer available.
-
New xAI and Inflection AI models: You can now set
Assistant.model
to useXAI
(e.g., modelgrok-beta
) orInflection AI
(e.g., modelinflection_3_pi
) by specifying these providers in your assistant configuration. Specify these providers inassistant.model
,call.squad.members.assistant.model
, orcall.squad.members.assistantOverrides.model
. -
Integrate Existing Trieve Vector Stores in Your Knowledge Base: When you create a knowledge base with
POST /knowledge-base
, you can now specifyvectorStoreProviderId
to use an existing vector store from your Trieve account. -
Create Vector Stores with Uploaded Files: You can first upload files using the
POST /files
endpoint, and then use thefileIds
to specify the IDs of previously uploaded files to create a new Trieve vector store. You can customize how your files are ingested, chunked, then rebalanced to ensure correct knowledge is served by your assistant:
- Split delimiters: Specify
splitDelimiters
to control how files are split before chunking (default is[.!?\n]
). - Splits per chunk: Set
targetSplitsPerChunk
to specify the desired number of splits per chunk when creating a vector store (default is 20 splits per chunk). - Chunk rebalancing: Set
rebalanceChunks
totrue
to evenly distribute remainder splits across chunks when creating a vector store to ensure balanced chunk sizes; for example, 66 splits withtargetSplitsPerChunk
of 20 will result in 3 chunks with 22 splits each.
- Customize Search Heuristics: You can filter or remove search results from your knowledge base:
- Filter by Score threshold: Set
scoreThreshold
to filter out chunks during searches based on their score. For cosine similarity, chunks below the threshold are filtered out; for other distance metrics, chunks above the threshold are filtered. - Remove stop words: Set
removeStopWords
totrue
to remove stop words during searches. The stop words list is specified inserver/src/stop-words.txt
, and queries that are entirely stop words will still be preserved.
- Updated Analytics Endpoint: The
/analytics
endpoint has changed—useGET /analytics
to retrieve analytics data instead ofPOST /analytics
.
-
Extended Silence Timeout for Assistants: You can now set
silenceTimeoutSeconds
up to 3600 seconds (previously 600 seconds) when creating or updating assistants and assistant overrides. This allows for longer periods of silence before an assistant session times out. -
New Credits Purchase Option: You can now purchase credits to your subscription by navigating to the updated billing page. Specify the dollar amount of your credits in the
credits
field to complete the purchase.