Background speech denoising helps create clearer conversations by filtering out unwanted sounds while users speak. Vapi offers two complementary denoising technologies that can be used independently or together for optimal results.
In this guide, you’ll learn to:
For most use cases, Smart Denoising alone provides excellent results. Fourier denoising is a highly experimental feature that requires significant tuning and may not work well in all environments.
Smart Denoising uses Krisp’s AI-powered technology to remove background noise in real-time. This method is highly effective for common noise sources like:
Fourier denoising uses frequency-domain filtering to remove consistent background noise. This experimental method offers fine-grained control through multiple parameters and includes automatic media detection for TV/music/radio backgrounds.
Fourier denoising is highly experimental and comes with significant limitations:
For most users, Smart Denoising should be sufficient. Only proceed with Fourier denoising if you have specific requirements and are prepared to test extensively.
Background speech denoising is configured through the backgroundSpeechDenoisingPlan property on your assistant:
Smart Denoising has a simple on/off configuration:
Enable or disable Krisp-powered smart denoising
Fourier denoising offers multiple parameters for fine-tuning:
Enable or disable experimental Fourier denoising
Automatically detect and filter consistent background media (TV/music/radio)
Fallback threshold in dB when no baseline is established (-80 to 0)
How far below the rolling baseline to filter audio, in dB (-30 to -5)
Rolling window size in milliseconds for baseline calculation (1000 to 30000)
Percentile for baseline calculation (1 to 99)
For maximum noise reduction, combine both methods. Processing order:
Minimal speech denoising for clear environments:
Aggressive filtering for high-noise environments:
Optimized for media background noise:
For most users, Smart Denoising alone is the recommended solution. It handles the vast majority of common noise scenarios effectively without configuration complexity. Only consider adding Fourier denoising if you have specific requirements that Smart Denoising cannot address.
Smart Denoising only:
Smart Denoising + Fourier Denoising:
Fourier Denoising should never be used alone. It’s designed to complement Smart Denoising by providing additional filtering after Krisp has done the initial noise reduction.
Audio quality: Aggressive filtering may affect voice quality. Test different settings to find the right balance between noise reduction and natural speech preservation.
Reduce filtering aggressiveness:
baselineOffsetDb (e.g., -20 instead of -15)baselinePercentile (e.g., 75 instead of 85)Increase filtering:
baselineOffsetDb (e.g., -12 instead of -15)mediaDetectionEnabled is true for TV/musicAdjust detection sensitivity:
windowSizeMs for more stabilitystaticThreshold if baseline isn’t establishing