Container Images Release
Deepgram Self-Hosted February 2026 Release (260212)
Container Images (release 260212)
-
quay.io/deepgram/self-hosted-api:release-260212- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.177.3
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-260212-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.107.0-1
-
Minimum required NVIDIA driver version:
>=570.172.08
-
-
quay.io/deepgram/self-hosted-license-proxy:release-260212- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.9.2
- Equivalent image to:
-
quay.io/deepgram/self-hosted-billing:release-260212- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.12.1
- Equivalent image to:
This Release Contains The Following Changes
- General Improvements — Keeps our software up-to-date.
New Default Concurrency Limits
We’re increasing default concurrency limits by up to 3X for Streaming Speech to Text, Text to Speech, and Voice Agent for Pay as you Go, Growth, and Enterprise plans.
For full details on the rate limits for your plan, see the API Rate Limits documentation.
🤖 New OpenAI & Gemini LLM Models Support
We’ve added support for new LLM models in our Voice Agent API!
Available Models:
- OpenAI GPT 5.2 Instant (gpt-5.2-instant)
- OpenAI GPT 5.2 Thinking (gpt-5.2)
- Google Gemini 3 Flash (gemini-3-flash-preview)
Implementation: Configure your chosen model in your Voice Agent settings:
For complete information about supported LLMs including the new models, visit our Voice Agent LLM Models documentation.
Nova-3 Multilingual Model Update
🌍 Nova-3 Multilingual Improvements
We’ve released an updated Nova-3 multilingual model, delivering accuracy improvements across supported languages, with the largest gains in code-switching scenarios.
This update focuses on improving real-world multilingual speech recognition, especially for inputs that mix languages within a single utterance or conversation.
Key improvements include:
- Lower Word Error Rate (WER) across both batch and streaming inference for all languages supported by the multilingual model
- Significantly improved code-switching handling, reducing word drops when languages are mixed
These improvements help developers build more reliable, natural multilingual voice experiences without changing APIs or configuration.
Learn more about Nova-3 Multilingual on the Models and Language Overview page.
Nova-3 Model Update
🌐 Nova-3 Adds Support for Hebrew, Farsi, and Urdu
We’re excited to announce the release of new Nova-3 monolingual models for Hebrew, Farsi, and Urdu! These additions bring industry-leading speech-to-text capabilities for users of these languages.
Nova-3 now supports the following new languages and language codes:
- Hebrew (
he) - Persian (Farsi) (
fa) - Urdu (
ur)
This release empowers developers and businesses to build more inclusive voice experiences for communities speaking Hebrew, Farsi, and Urdu across the globe.
Learn more about Nova-3 on the Models and Language Overview page.
Container Images Release
Deepgram Self-Hosted January 2026 Release (260129)
Container Images (release 260129)
-
quay.io/deepgram/self-hosted-api:release-260129- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.177.3
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-260129-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.107.0-1
-
Minimum required NVIDIA driver version:
>=570.172.08
-
-
quay.io/deepgram/self-hosted-license-proxy:release-260129- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.9.2
- Equivalent image to:
-
quay.io/deepgram/self-hosted-billing:release-260129- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.12.1
- Equivalent image to:
This Release Contains The Following Changes
- Nova-3 Supports 12 New Languages — Belarusian (be), Bengali (bn), Bosnian (bs), Croatian (hr), Kannada (kn), Macedonian (mk), Marathi (mr), Serbian (sr), Slovenian (sl), Tamil (ta), Tagalog (tl), and Telugu (te). Note that Arabic support is not included in this release. See the full announcement for details.
- General Improvements — Keeps our software up-to-date.
Nova-3 Model Update
🌐 Nova-3 Now Supports Arabic and all major Arabic dialects
We’re excited to announce the release of the Nova-3 Arabic monolingual model, which now brings industry-leading speech-to-text support for Arabic and its major dialects!
Nova-3 Arabic supports the following dialects and language codes:
- Arabic (General) (
ar) - Dialects supported:
- United Arab Emirates Arabic (
ar-AE) - Saudi Arabian Arabic (
ar-SA) - Qatari Arabic (
ar-QA) - Kuwaiti Arabic (
ar-KW) - Syrian Arabic (
ar-SY) - Lebanese Arabic (
ar-LB) - Palestinian Arabic (
ar-PS) - Jordanian Arabic (
ar-JO) - Egyptian Arabic (
ar-EG) - Sudanese Arabic (
ar-SD) - Chadian Arabic (
ar-TD) - Moroccan Arabic (
ar-MA) - Algerian Arabic (
ar-DZ) - Tunisian Arabic (
ar-TN) - Iraqi Arabic (
ar-IQ) - Iranian Arabic (
ar-IR)
- United Arab Emirates Arabic (
This expansion empowers developers and businesses to build more inclusive, regionally-tailored voice applications for Arabic-speaking users across the globe.
Learn more about Nova-3 on the Models and Language Overview page.
Nova-3 Model Update
🌐 Nova-3 supports 12 new languages
We’re pleased to announce the addition of 12 new languages for Nova-3 monolingual models. This expansion makes Nova-3 even more versatile and accessible for global users. The newly supported languages and their corresponding language codes are:
Newly Supported:
- Belarusian (
be) - Bengali (
bn) - Bosnian (
bs) - Croatian (
hr) - Kannada (
kn) - Macedonian (
mk) - Marathi (
mr) - Serbian (
sr) - Slovenian (
sl) - Tamil (
ta) - Tagalog (
tl) - Telugu (
te)
Learn more about Nova-3 on the Models and Language Overview page.
Multiple LLM Provider Support
We’ve added new functionality that allows users to specify multiple LLM providers for your Voice Agent, ensuring your agent will automatically fallback to another provider should you experience any issues. The think object supports both a single provider and an array of providers. LLM providers will be used in the order that you specify them.
For more details, visit our Voice Agent Multiple LLM Models documentation
🤖 New LLM Models Support
We’ve added support for new LLM models in our Voice Agent API!
Available Models:
- OpenAI GPT 5.1 Chat (gpt-5.1-chat-latest)
- OpenAI GPT 5.1 (gpt-5.1)
- Anthropic Claude Sonnet 4.5 (claude-sonnet-4-5
- Google Gemini 3 (gemini-3-pro-preview)
Implementation: Configure your chosen model in your Voice Agent settings:
For complete information about supported LLMs including the new models, visit our Voice Agent LLM Models documentation.
Container Images Release
Deepgram Self-Hosted January 2026 Release (260115)
Container Images (release 260115)
-
quay.io/deepgram/self-hosted-api:release-260115- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.176.0
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-260115-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.107.0-1
-
Minimum required NVIDIA driver version:
>=570.172.08
-
-
quay.io/deepgram/self-hosted-license-proxy:release-260115- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.9.2
- Equivalent image to:
-
quay.io/deepgram/self-hosted-billing:release-260115- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.12.1
- Equivalent image to:
January 2026 Self-Hosted Release: Update Recommendation
In Deepgram’s January 2026 self-hosted release (release-260115), we added new functionality to improve TTS response times from our API and Engine containers.
Due to this product change, the January 2026 self-hosted release is not backwards-compatible with previous releases when used to serve TTS traffic. It is a breaking change in how the API and Engine containers communicate with each other. To avoid any downtime in your self-hosted deployment, the updated version of the Engine node (3.107.0-1) must be running in advance of the updated version of the API node (1.176.0) serving requests. Note that the new version of the Engine (3.107.0-1) is compatible with previous versions of the API, so the Engine container must be deployed before the API container. The blue-green deployment strategy is one possible deployment strategy, but there are others that satisfy the requirement that the Engine container is deployed first. This is only applicable for deployments serving TTS traffic. The breaking change is not relevant to deployments serving STT traffic.
The License Proxy node is not impacted by breaking changes, but in the context of a complete Deepgram self-hosted deployment, it is most cohesive to also include the update to the License Proxy node (1.9.2) in the blue-green deployment.
This Release Contains The Following Changes
- Improves Transcription of “Um” in Portuguese — Monolingual Portuguese STT now transcribes “um” (meaning “one”) as a non-filler word, and “um” is included in Portuguese transcripts, even when the
filler_wordsfeature is disabled. - General Improvements — Keeps our software up-to-date