Understand the different service limits of Deepgram's APIs.

The error 429: Too Many Requests is returned when your project has more concurrent requests than the rate limits allow.

Pay as You Go / Growth

Limits to consider if you use the Pay as You Go or Growth plans with Deepgram.

Speech to Text

ModelService Limit
Nova-2 Pre-RecordedUp to 100 concurrent requests
Streaming Up to 100 concurrent requests
Nova Pre-RecordedUp to 100 concurrent requests
Streaming Up to 100 concurrent requests
EnhancedPre-RecordedUp to 100 concurrent requests
Streaming Up to 100 concurrent requests
BasePre-RecordedUp to 100 concurrent requests
Streaming Up to 100 concurrent requests
Whisper CloudPre-Recorded Up to 5 concurrent requests

Audio Intelligence

If you include Audio Intelligence features in requests to /listen, you will be subject to the service limits noted in the table below.

ModelService Limit
Intent RecognitionUp to 10 concurrent requests
Entity DetectionUp to 5 concurrent requests
Sentiment AnalysisUp to 10 concurrent requests
SummarizationUp to 10 concurrent requests
Topic DetectionUp to 10 concurrent requests

Text Intelligence

ModelService Limit
Intent RecognitionUp to 10 concurrent requests
Sentiment AnalysisUp to 10 concurrent requests
SummarizationUp to 10 concurrent requests
Topic DetectionUp to 10 concurrent requests

Text to Speech

ModelService LimitFeedback
AuraUp to 480 requests / minShare your feedback on our TTS rate limits

Enterprise

Starting limits to consider if you have an Enterprise Contract with Deepgram.

📘

New and existing Enterprise customers can request a Service Limit increase by discussing your needs with the Deepgram Sales Team.

Speech to Text

ModelService Limit
Nova-2 Pre-RecordedStarting at 100 concurrent requests
Streaming Starting at 100 concurrent requests
Nova Pre-RecordedStarting at 100 concurrent requests
Streaming Starting at 100 concurrent requests
EnhancedPre-RecordedStarting at 100 concurrent requests
Streaming Starting at 100 concurrent requests
BasePre-RecordedStarting at 100 concurrent requests
Streaming Starting at 100 concurrent requests
Whisper CloudPre-Recorded Starting at 15 concurrent requests

Audio Intelligence

If you include Audio Intelligence features in requests to /listen, you will be subject to the service limits noted in the table below.

ModelService Limit
Intent RecognitionStarting at 10 concurrent requests
Entity DetectionStarting at 10 concurrent requests
Sentiment AnalysisStarting at 10 concurrent requests
SummarizationStarting at 10 concurrent requests
Topic DetectionStarting at 10 concurrent requests

Text Intelligence

ModelService Limit
Intent RecognitionStarting at 10 concurrent requests
Sentiment AnalysisStarting at 10 concurrent requests
SummarizationStarting at 10 concurrent requests
Topic DetectionStarting at 10 concurrent requests

Text to Speech

ModelService LimitFeedback
AuraStarting at 2400 requests / minShare your feedback on our TTS rate limits