Deepgram Self-Hosted April 2026 Release (260402)
Container Images (release 260402)
-
quay.io/deepgram/self-hosted-api:release-260402- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.181.3
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-260402-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.114.5
-
Minimum required NVIDIA driver version:
>=570.172.08
-
-
quay.io/deepgram/self-hosted-license-proxy:release-260402- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.10.1
- Equivalent image to:
-
quay.io/deepgram/self-hosted-billing:release-260402- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.13.0
- Equivalent image to:
This Release Contains The Following Changes
- Certificate Endpoint Fix — Engine now responds to
/v1/certificatesin addition to/certificates, consistent with the other container images. See Certificate Status for details. - Model Name Consistency — The
/v1/modelsendpoint now returns acanonical_namefield matching the model name used in/v1/listenrequests. - General Improvements — Keeps our software up-to-date.
Nova-3 Model Update
🌏 Nova-3 now supports the following new languages and language codes:
- Chinese (Mandarin, Simplified):
zh,zh-CN,zh-Hans - Chinese (Mandarin, Traditional):
zh-TW,zh-Hant
Access these models by setting model="nova-3" and the relevant language code in your request.
Learn more about Nova-3 and supported languages on the Models and Language Overview page.
TTS speed controls & updated LLM models
TTS speak speed (Early Access)
You can now control the speaking rate of Deepgram TTS in the Voice Agent API using the agent.speak.provider.speed parameter. This parameter accepts a float value between 0.7 and 1.5, with 1.0 as the default.
This feature is in Early Access and is only available for Deepgram TTS. For more details, see TTS voice controls. To request access, contact your Account Executive or reach out to sales@deepgram.com.
Updated LLM models
New OpenAI models — Two new models are now available in the Standard pricing tier:
gpt-5.4-nanogpt-5.4-mini
Gemini 2.0 Flash deprecated — The gemini-2.0-flash model is now deprecated. We recommend migrating to gemini-2.5-flash or a newer Gemini model. See the Google models table for alternatives.
For the full list of supported models and pricing tiers, see the Voice Agent LLM Models documentation.
Deepgram Self-Hosted March 2026 Release (260319)
Container Images (release 260319)
-
quay.io/deepgram/self-hosted-api:release-260319- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.180.1
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-260319-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.114.4
-
Minimum required NVIDIA driver version:
>=570.172.08
-
-
quay.io/deepgram/self-hosted-license-proxy:release-260319- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.10.1
- Equivalent image to:
-
quay.io/deepgram/self-hosted-billing:release-260319- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.13.0
- Equivalent image to:
This Release Contains The Following Changes
- Flux Regression Fix — Resolves Flux support regression from the 260305 release. See Deploy Flux Model (STT) for deployment details.
- Nova-3 Language Expansion — New models: Thai (
th,th-TH), Chinese Cantonese Traditional (zh-HK). Improved models: Bengali (bn), Marathi (mr), Tamil (ta), Telugu (te). See the full announcement for details. - Flux Status Metrics — Self-hosted status endpoint now includes Flux stream metrics. See Status Endpoint for details.
- Certificate Status Endpoint — New
/v1/certificatesendpoint on all container images returns beginning-of-support, end-of-support, and end-of-life dates. See Certificate Status for details. - Log Formats — New configurable log output formats: Full, Compact, Pretty, Json. See Log Formats for configuration details.
- General Improvements — Keeps our software up-to-date.
Nova-3 Model Update
🌏 Nova-3 now supports the following new languages and language codes:
- Chinese (Cantonese, Traditional):
zh-HK - Thai:
th,th-TH
🚀 Also releasing improved Nova-3 models for the following languages:
- Bengali (
bn) - Marathi (
mr) - Tamil (
ta) - Telugu (
te)
Access these models by setting model="nova-3" and the relevant language code in your request.
Learn more about Nova-3 and supported languages on the Models and Language Overview page.
🤖 New LLM Models Support & Bug Fixes
We’ve added support for new LLM models in the Voice Agent API:
- OpenAI GPT-5.3 Instant (
gpt-5.3-chat-latest) - OpenAI GPT 5.4 (
gpt-5.4) - Google Gemini 3.1 Flash Lite (
gemini-3.1-flash-lite)
Example:
For the full list of supported models and pricing tiers, visit our Voice Agent LLM Models documentation.
Fixes
- Resolves an issue where the GPT-5.2 Instant model used an incorrect model ID and pricing tier. The model now uses the correct ID (
gpt-5.2-chat-latest) and is assigned to theAdvancedtier.
Nova-3 Model Update
🎯 Nova-3 Swedish and Dutch Model Enhancements
We’ve released updated Nova-3 Swedish and Nova-3 Dutch models, offering improved accuracy for both streaming and batch transcription.
Access these models by setting model: "nova-3" and the relevant language code:
- Swedish (
sv,sv-SE) - Dutch (
nl)
Learn more about Nova-3 on the Models and Language Overview page.
Reasoning mode for OpenAI thinking models
You can now control the reasoning effort of supported OpenAI reasoning models using the new reasoning_mode parameter in the think provider configuration. This parameter maps to OpenAI’s reasoning_effort and accepts low, medium, or high.
Example:
For more details, visit the Configure the Voice Agent documentation.
Model Improvement Program pricing update
Pay as you Go and Growth customers can now opt in or out of the Model Improvement Program with no impact on the rates listed on deepgram.com/pricing.
Deepgram Self-Hosted March 2026 Release (260305)
We are aware of an issue with Flux in this release. Do not use this release for Flux deployments. This is resolved in the 260319 release.
Container Images (release 260305)
-
quay.io/deepgram/self-hosted-api:release-260305- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.179.5
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-260305-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.113.2
-
Minimum required NVIDIA driver version:
>=570.172.08
-
-
quay.io/deepgram/self-hosted-license-proxy:release-260305- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.10.1
- Equivalent image to:
-
quay.io/deepgram/self-hosted-billing:release-260305- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.12.1
- Equivalent image to:
This Release Contains The Following Changes
- Nova-3 Right-to-Left Language Support — Nova-3 now supports Arabic, Hebrew, Farsi, and Urdu. See the full announcement for details.
- Nova-3 Multilingual Model Update — Accuracy improvements across all supported languages, with the largest gains in code-switching scenarios. See the full announcement for details.
- Abbreviated Dates in Smart Formatting — Smart formatting now recognizes and formats abbreviated dates.
- General Improvements — Keeps our software up-to-date.