API Rate Limits | Deepgram's Docs

Pay as You Go / Growth

Limits to consider if you use the Pay as You Go or Growth plans with Deepgram.

API	Connection Limits
Voice Agent API	Up to 5 concurrent connections

If multiple services are used in one API call (e.g Speech to Text + Sentiment Analysis), the lower of the rate limits is applied.

Model	Service Limit
Nova-3	`Pre-Recorded` Up to 100 concurrent requests `Streaming` Up to 50 concurrent requests
Nova-2	`Pre-Recorded` Up to 100 concurrent requests `Streaming` Up to 50 concurrent requests
Nova	`Pre-Recorded` Up to 100 concurrent requests `Streaming` Up to 50 concurrent requests
Enhanced	`Pre-Recorded` Up to 100 concurrent requests `Streaming` Up to 50 concurrent requests
Base	`Pre-Recorded` Up to 100 concurrent requests `Streaming` Up to 50 concurrent requests
Whisper Cloud	`Pre-Recorded` Up to 5 concurrent requests

Model	Service Limit
Aura	Pay as You Go: Up to 5 concurrent requests
Aura	Growth: Up to 5 concurrent requests
Aura-2	Pay as You Go: Up to 5 concurrent requests
Aura-2	Growth: Up to 5 concurrent requests

Model	Service Limit
Aura	Pay as You Go: Up to 5 concurrent requests
Aura	Growth: Up to 5 concurrent requests
Aura-2	Pay as You Go: Up to 5 concurrent requests
Aura-2	Growth: Up to 5 concurrent requests

If you include Audio Intelligence features in requests to /listen, you will be subject to the service limits noted in the table below.

Starting limits to consider if you have an Enterprise Contract with Deepgram.

New and existing Enterprise customers can request a Service Limit increase by discussing your needs with the Deepgram Sales Team.

API	Connection Limits
Voice Agent API	Starting at 50 concurrent connections

If multiple services are used in one API call (e.g Speech to Text + Sentiment Analysis), the lower of the rate limits is applied.

Model	Service Limit
Nova-3	`Pre-Recorded` Starting at 100 concurrent requests `Streaming` Starting at 100 concurrent requests
Nova-2	`Pre-Recorded` Starting at 100 concurrent requests `Streaming` Starting at 100 concurrent requests
Nova	`Pre-Recorded` Starting at 100 concurrent requests `Streaming` Starting at 100 concurrent requests
Enhanced	`Pre-Recorded` Starting at 100 concurrent requests `Streaming` Starting at 100 concurrent requests
Base	`Pre-Recorded` Starting at 100 concurrent requests `Streaming` Starting at 100 concurrent requests
Whisper Cloud	`Pre-Recorded` Starting at 15 concurrent requests

Model	Service Limit
Aura	Starting at 25 concurrent requests
Aura-2	Starting at 25 concurrent requests

Model	Service Limit
Aura	Starting at 25 concurrent requests
Aura-2	Starting at 25 concurrent requests

If you include Audio Intelligence features in requests to /listen, you will be subject to the service limits noted in the table below.

Model	Service Limit
Intent Recognition	Starting at 10 concurrent requests
Entity Detection	Starting at 10 concurrent requests
Sentiment Analysis	Starting at 10 concurrent requests
Summarization	Starting at 10 concurrent requests
Topic Detection	Starting at 10 concurrent requests

Model	Service Limit
Intent Recognition	Starting at 10 concurrent requests
Sentiment Analysis	Starting at 10 concurrent requests
Summarization	Starting at 10 concurrent requests
Topic Detection	Starting at 10 concurrent requests