Understanding Call Concurrency
Overview
Call concurrency represents how many Vapi calls can be active at the same time. Each call occupies one slot, similar to using a finite set of phone lines.
In this guide, you’ll learn to:
- Understand the default concurrency allocation and when it is usually sufficient
- Keep outbound and inbound workloads within plan limits
- Increase reserved capacity directly from the Vapi Dashboard
- Inspect concurrency data through API responses and analytics queries
What is concurrency?
Every Vapi account includes 10 concurrent call slots by default. When all slots are busy, new outbound dials or inbound connections wait until a slot becomes free.
Rarely hit concurrency caps unless traffic surges (launches, seasonal spikes).
More likely to reach limits when running large calling batches.
These limits ensure the underlying compute stays reliable for every customer. Higher concurrency requires reserving additional capacity, which Vapi provides through custom or add-on plans.
Managing concurrency
Outbound campaigns
Batch long lead lists into smaller chunks (for example, 50–100 numbers) and run those batches sequentially. This keeps your peak concurrent calls near the default limit while still working through large sets quickly.
High-volume operations
If you regularly exceed 50,000 minutes per month, talk with Vapi about:
- Custom plans that include higher baked-in concurrency
- Add-on bundles that let you purchase extra call lines only when you need them
Use billing reports to pair minute usage with concurrency spikes so you can upgrade before calls are blocked.
Increase your concurrency limit
You can raise or reserve more call lines without contacting support:
- Open the Vapi Dashboard.
- Navigate to Settings → Billing.
- Find Reserved Concurrency (Call Lines).
- Increase the limit or purchase add-on concurrency lines.
Changes apply immediately, so you can scale ahead of known traffic surges.
View concurrency in call responses
When you create a call with POST /call, the response includes a subscriptionLimits object that shows the current state of your account.
Example request
Example response snippet
Field reference
concurrencyBlocked—trueif the call could not start because all slots were full.concurrencyLimit— Total concurrent call slots currently available to your org.remainingConcurrentCalls— How many slots were open at the time you created the call.
Build monitoring around these values to alert when you approach the cap.
Track concurrency with the Analytics API
Use the /analytics endpoint to review historical concurrency usage and spot patterns that justify more capacity.
Example request
Example response
Adjust the timeRange.step to inspect usage by hour, day, or week. Peaks that align with campaign launches, seasonality, or support events highlight when you should reserve additional call lines.
Next steps
- Call queue management: Build a Twilio queue to buffer calls when you hit concurrency caps.
- Outbound campaign planning: Design outbound strategies that pair batching with analytics.
- Enterprise plans: Review larger plans that include higher default concurrency.