Voice fallback configuration gives you the ability to continue your call in the event that your primary voice fails. Your assistant will sequentially fallback to only the voices you configure within your plan, in the exact order you specify.
Without a fallback plan configured, your call will end with an error in the event that your chosen voice provider fails.
When a voice failure occurs, Vapi will:
Scroll down to find the Fallback Voices collapsible section. A warning indicator appears if no fallback voices are configured.
Click Add Fallback Voice to configure your first fallback:
Add the fallbackPlan property to your assistant’s voice configuration, and specify the fallback voices within the voices property.
Fallback voices must be valid JSON configurations, not strings. The order matters—Vapi will choose fallback voices starting from the beginning of the list.
Each voice provider supports different configuration options. Expand the accordion below to see available settings for each provider.
eleven_multilingual_v2, eleven_turbo_v2, eleven_turbo_v2_5, eleven_flash_v2, eleven_flash_v2_5, or eleven_monolingual_v1.sonic-english, sonic-3, etc.).["happiness:high", "curiosity:medium"]).tts-1, tts-1-hd, or realtime models.tts-1 or tts-1-hd models.auto for auto-detection.arcana, mistv2, or mist. Defaults to arcana.<200> for 200ms pause).{h'El.o}).female_happy, male_sad, female_angry, male_surprised).PlayHT2.0, PlayHT2.0-turbo, Play3.0-mini, or PlayDialog.aura or aura-2. Defaults to aura-2.octave2).speech-02-hd (high-fidelity) or speech-02-turbo (low latency). Defaults to speech-02-turbo.happy, sad, angry, fearful, surprised, disgusted, neutral).neu_fast).lightning).There is no change to the pricing of the voices. Your call will not incur any extra fees while using fallback voices, and you will be able to see the cost for each voice in your end-of-call report.
You can configure as many fallback voices as you need. However, we recommend 2-3 fallbacks from different providers for optimal reliability.
Users may notice a brief pause and a change in voice characteristics when switching to a fallback voice. Selecting voices with similar properties helps minimize this disruption.