Rate Limits
The Voicy API enforces rate limits to ensure fair usage and protect service stability.Limits by Plan
| Plan | Requests/minute | Concurrent calls | Max call duration |
|---|---|---|---|
| Standard | 60 | 10 | 30 minutes |
| Enterprise | 600 | 100 | 60 minutes |
Rate Limit Headers
Rate limit information is included in response headers:| Header | Description |
|---|---|
X-RateLimit-Limit | Maximum requests per minute |
X-RateLimit-Remaining | Requests remaining in current window |
X-RateLimit-Reset | Unix timestamp when the limit resets |
Handling Rate Limits
When you exceed the rate limit, you’ll receive a429 Too Many Requests response:
Best Practices
Implement exponential backoff
Implement exponential backoff
When you receive a 429 response, wait before retrying:
Check rate limit headers
Check rate limit headers
Monitor your usage by checking the response headers:
Queue requests
Queue requests
For bulk operations, implement a request queue:
Concurrent Call Limits
The concurrent call limit applies to active calls (calls ininitiated, ringing, or in_progress status).
If you try to create a call when at the limit: