Container Images Release
Deepgram Self-Hosted August 2025 Release (250828)
Container Images (release 250828)
-
quay.io/deepgram/self-hosted-api:release-250828
- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.156.1
quay.io/deepgram/onprem-api:release-250828
quay.io/deepgram/onprem-api:1.156.1
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-250828
-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.100.0
quay.io/deepgram/onprem-engine:release-250828
quay.io/deepgram/onprem-engine:3.100.0
-
Minimum required NVIDIA driver version:
>=570.172.08
-
-
quay.io/deepgram/self-hosted-license-proxy:release-250828
- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.8.0
quay.io/deepgram/onprem-license-proxy:release-250828
quay.io/deepgram/onprem-license-proxy:1.8.0
- Equivalent image to:
-
quay.io/deepgram/self-hosted-billing:release-250828
- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.11.2
quay.io/deepgram/onprem-billing:release-250828
quay.io/deepgram/onprem-billing:1.11.2
- Equivalent image to:
This Release Contains The Following Changes
- GPU Required Configuration — New
[health]
section in engine.toml withgpu_required
configuration option allows Engine to fail on startup if no GPU is detected. While Engine can run without a GPU, production deployments require one for acceptable performance. Setgpu_required = true
to fail fast if no GPU is available, rather than running with severely degraded performance. Default: false. - General Improvements — Keeps our software up-to-date.
Important: This is the last release that will include onprem-*
image tags. The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-*
Quay repositories. Starting with the next release in September 2025, we will only publish new images to self-hosted-*
repos, deprecating onprem-*
repository variants.
Container Images Release
Deepgram Self-Hosted August 2025 Release (250814)
Container Images (release 250814)
-
quay.io/deepgram/self-hosted-api:release-250814
- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.156.1
quay.io/deepgram/onprem-api:release-250814
quay.io/deepgram/onprem-api:1.156.1
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-250814
-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.97.2
quay.io/deepgram/onprem-engine:release-250814
quay.io/deepgram/onprem-engine:3.97.2
-
Minimum required NVIDIA driver version:
>=570.172.08
-
-
quay.io/deepgram/self-hosted-license-proxy:release-250814
- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.8.0
quay.io/deepgram/onprem-license-proxy:release-250814
quay.io/deepgram/onprem-license-proxy:1.8.0
- Equivalent image to:
-
quay.io/deepgram/self-hosted-billing:release-250814
- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.11.2
quay.io/deepgram/onprem-billing:release-250814
quay.io/deepgram/onprem-billing:1.11.2
- Equivalent image to:
This Release Contains The Following Changes
- Aura-2 Self-Hosted — Now fully supports Deepgram’s Aura-2 text-to-speech service. You can refer to the Deploy TTS Services documentation for more details.
- Automatic Engine Shutdown on GPU Errors — Deepgram Engine will now automatically stop if a GPU error is detected, preventing potential data corruption. You can disable this by setting the environment variable
IMPELLER_DISABLE_AUTOSTOP=true
. - Smarter ZIP Code Formatting — Now supports proper formatting for alphanumeric postal codes (e.g., UK, Canada).
- General Improvements — Keeps our software up-to-date.
Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-*
Quay repositories. Through August 2025, both onprem-*
and self-hosted-*
image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted
repositories. Starting in September 2025, we will only publish new images to self-hosted-*
repos, deprecating onprem-*
repository variants.
Container Images Release
Deepgram Self-Hosted July 2025 Release (250731)
Container Images (release 250731)
-
quay.io/deepgram/self-hosted-api:release-250731
- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.154.1
quay.io/deepgram/onprem-api:release-250731
quay.io/deepgram/onprem-api:1.154.1
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-250731
-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.94.0
quay.io/deepgram/onprem-engine:release-250731
quay.io/deepgram/onprem-engine:3.94.0
-
Minimum required NVIDIA driver version:
>=550.163.01
-
-
quay.io/deepgram/self-hosted-license-proxy:release-250731
- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.8.0
quay.io/deepgram/onprem-license-proxy:release-250731
quay.io/deepgram/onprem-license-proxy:1.8.0
- Equivalent image to:
-
quay.io/deepgram/self-hosted-billing:release-250731
- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.11.2
quay.io/deepgram/onprem-billing:release-250731
quay.io/deepgram/onprem-billing:1.11.2
- Equivalent image to:
This Release Contains The Following Changes
- Adds
redact_usage
functionality to redact the values of keyterms and other URL parameters. This is now enabled by default, and may be toggled via theredact_usage
boolean feature flag in api.toml. See our redact usage documentation for more information. - Adds targeted support for CUDA 12.8. We recommend updating to the latest CUDA 12.8 release for optimal performance and stability. See our driver installation doc for guidance on updating to newer NVIDIA driver and CUDA toolkit versions.
- Returns a 400 Unknown Model error with reference to our error documentation when NER is misconfigured.
- Ensures presence of a word-level
language
tag on multilingual transcripts that apply smart formatting. - Improves smart formatting of dates, alphanumerics, and numbers with units.
- Keeps our software up-to-date.
Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-*
Quay repositories. Through August 2025, both onprem-*
and self-hosted-*
image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted
repositories. Starting in September 2025, we will only publish new images to self-hosted-*
repos, deprecating onprem-*
repository variants.
Container Images Release
Deepgram Self-Hosted July 2025 Release (250710)
Container Images (release 250710)
-
quay.io/deepgram/self-hosted-api:release-250710
- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.151.8
quay.io/deepgram/onprem-api:release-250710
quay.io/deepgram/onprem-api:1.151.8
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-250710
-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.91.0
quay.io/deepgram/onprem-engine:release-250710
quay.io/deepgram/onprem-engine:3.91.0
-
Minimum required NVIDIA driver version:
>=550.163.01
-
-
quay.io/deepgram/self-hosted-license-proxy:release-250710
- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.8.0
quay.io/deepgram/onprem-license-proxy:release-250710
quay.io/deepgram/onprem-license-proxy:1.8.0
- Equivalent image to:
-
quay.io/deepgram/self-hosted-billing:release-250710
- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.11.2
quay.io/deepgram/onprem-billing:release-250710
quay.io/deepgram/onprem-billing:1.11.2
- Equivalent image to:
This Release Contains The Following Changes
- Adds more verbose logging in Voice Agent for failures in TTS, LLM, and function-calling.
- Improves redaction accuracy around entities and punctuation.
- Keeps our software up-to-date.
Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-*
Quay repositories. Through August 2025, both onprem-*
and self-hosted-*
image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted
repositories. Starting in September 2025, we will only publish new images to self-hosted-*
repos, deprecating onprem-*
repository variants.
Container Images Release
Container Images (release 250626)
-
quay.io/deepgram/self-hosted-api:release-250626
- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.150.2
quay.io/deepgram/onprem-api:release-250626
quay.io/deepgram/onprem-api:1.150.2
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-250626
-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.91.0
quay.io/deepgram/onprem-engine:release-250626
quay.io/deepgram/onprem-engine:3.91.0
-
Minimum required NVIDIA driver version:
>=550.163.01
-
-
quay.io/deepgram/self-hosted-license-proxy:release-250626
- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.8.0
quay.io/deepgram/onprem-license-proxy:release-250626
quay.io/deepgram/onprem-license-proxy:1.8.0
- Equivalent image to:
-
quay.io/deepgram/self-hosted-billing:release-250626
- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.11.2
quay.io/deepgram/onprem-billing:release-250626
quay.io/deepgram/onprem-billing:1.11.2
- Equivalent image to:
This Release Contains The Following Changes
- Improves smart formatting for emails, alphanumerics, quantities, and percentages.
- Expands language support for profanity filtering in German, Swiss German, Polish, Portuguese, Spanish, and Swedish.
- Resolves an edge case when handling certain corrupt audio and now returns an HTTP 400 error code.
- Keeps our software up-to-date.
Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-*
Quay repositories. For the next three months, both onprem-*
and self-hosted-*
image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted
repositories. Subsequently, we will only publish new images to self-hosted-*
repos, deprecating onprem-*
repository variants.
Container Images Release
Container Images (release 250610)
-
quay.io/deepgram/self-hosted-api:release-250610
- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.146.1
quay.io/deepgram/onprem-api:release-250610
quay.io/deepgram/onprem-api:1.146.1
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-250610
-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.89.2
quay.io/deepgram/onprem-engine:release-250610
quay.io/deepgram/onprem-engine:3.89.2
-
Minimum required NVIDIA driver version:
>=550.163.01
-
-
quay.io/deepgram/self-hosted-license-proxy:release-250610
- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.8.0
quay.io/deepgram/onprem-license-proxy:release-250610
quay.io/deepgram/onprem-license-proxy:1.8.0
- Equivalent image to:
-
quay.io/deepgram/self-hosted-billing:release-250610
- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.11.2
quay.io/deepgram/onprem-billing:release-250610
quay.io/deepgram/onprem-billing:1.11.2
- Equivalent image to:
This Release Contains The Following Changes
- Adds full support for Voice Agent v1 API.
- Addresses an issue with the
engine_active_requests
metric for streaming STT auto-scaling. - Resolves an issue with year formatting for
smart_format.
- Keeps our software up-to-date.
Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-*
Quay repositories. For the next four months, both onprem-*
and self-hosted-*
image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted
repositories. Subsequently, we will only publish new images to self-hosted-*
repos, deprecating onprem-*
repository variants.
Container Images Release
Container Images (release 250505)
-
quay.io/deepgram/self-hosted-api:release-250505
- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.142.1
quay.io/deepgram/onprem-api:release-250505
quay.io/deepgram/onprem-api:1.142.1
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-250505
-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.89.0
quay.io/deepgram/onprem-engine:release-250505
quay.io/deepgram/onprem-engine:3.89.0
-
Minimum required NVIDIA driver version:
>=550.163.01
-
-
quay.io/deepgram/self-hosted-license-proxy:release-250505
- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.8.0
quay.io/deepgram/onprem-license-proxy:release-250505
quay.io/deepgram/onprem-license-proxy:1.8.0
- Equivalent image to:
-
quay.io/deepgram/self-hosted-billing:release-250505
- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.11.2
quay.io/deepgram/onprem-billing:release-250505
quay.io/deepgram/onprem-billing:1.11.2
- Equivalent image to:
This Release Contains The Following Changes
- Extends numeral formatting for supported languages when using
detect_language=true
. - Improves formatting of dates.
- Resolves an issue with Whisper functionality.
- Keeps our software up-to-date.
Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-*
Quay repositories. For the next five months, both onprem-*
and self-hosted-*
image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted
repositories. Subsequently, we will only publish new images to self-hosted-*
repos, deprecating onprem-*
repository variants.
Container Images Release
Container Images (release 250331)
-
quay.io/deepgram/self-hosted-api:release-250331
- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.141.0
quay.io/deepgram/onprem-api:release-250331
quay.io/deepgram/onprem-api:1.141.0
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-250331
-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.85.4
quay.io/deepgram/onprem-engine:release-250331
quay.io/deepgram/onprem-engine:3.85.4
-
Minimum required NVIDIA driver version:
>=530.30.02
-
Maximum required NVIDIA driver version:
<=561.00.00
-
-
quay.io/deepgram/self-hosted-license-proxy:release-250331
- It is essential to upgrade the license proxy before upgrading the API or Engine for this release. Neglecting to upgrade the license proxy will cause a breaking change on deployment.
- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.8.0
quay.io/deepgram/onprem-license-proxy:release-250331
quay.io/deepgram/onprem-license-proxy:1.8.0
-
quay.io/deepgram/self-hosted-billing:release-250331
- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.11.2
quay.io/deepgram/onprem-billing:release-250331
quay.io/deepgram/onprem-billing:1.11.2
- Equivalent image to:
This Release Contains The Following Changes
- Adds support for our new class of multilingual Nova-3 models.
- Use
model=nova-3&language=multi
. - Contact your Deepgram account representative for access to Nova-3.
- Use
- Significantly improves formatting for all transcripts, including in multilingual contexts. This new formatting is powered by a dedicated Named Entity Recognition (NER) model that performs sophisticated recognition of entities, including phone numbers, addresses, and dates, in order to return excellent formatted transcripts. NER is required for smart-formatted Nova-3 transcripts, and strongly suggested for all other speech-to-text transcripts.
- Consult our instructions for enabling NER formatting to make required updates to configuration as well as models.
- Improvements to smart-formatting have modified the formatting of some entity classes. Please contact your Deepgram account representative if you have any questions.
- Improves streaming smart-formatting.
- Improves non-English numeral formatting.
- Resolves an issue with Simplified Chinese (
zh
/zh-CN
) language requests. - Keeps our software up-to-date.
Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-*
Quay repositories. For the next six months, both onprem-*
and self-hosted-*
image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted
repositories. Subsequently, we will only publish new images to self-hosted-*
repos, deprecating onprem-*
repository variants.
Container Images Release
Container Images (release 250307)
-
quay.io/deepgram/self-hosted-api:release-250307
- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.139.0
quay.io/deepgram/onprem-api:release-250307
quay.io/deepgram/onprem-api:1.139.0
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-250307
-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.84.0
quay.io/deepgram/onprem-engine:release-250307
quay.io/deepgram/onprem-engine:3.84.0
-
Minimum required NVIDIA driver version:
>=530.30.02
-
Maximum required NVIDIA driver version:
<=561.00.00
-
-
quay.io/deepgram/self-hosted-license-proxy:release-250307
- It is essential to upgrade the license proxy before upgrading the API or Engine for this release. Neglecting to upgrade the license proxy will cause a breaking change on deployment.
- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.8.0
quay.io/deepgram/onprem-license-proxy:release-250307
quay.io/deepgram/onprem-license-proxy:1.8.0
-
quay.io/deepgram/self-hosted-billing:release-250307
- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.10.0
quay.io/deepgram/onprem-billing:release-250307
quay.io/deepgram/onprem-billing:1.10.0
- Equivalent image to:
This Release Contains The Following Changes
-
Adds support for our new class of English Nova-3 models, including both General and Medical. Use
model=nova-3
ormodel=nova-3-medical
. For further information, see our Model Selection documentation. Contact your Deepgram account representative for access to Nova-3. Please note that multilingual Nova-3 will be available in a later release. -
Significantly improves formatting for all English transcripts. This new formatting is powered by a dedicated Named Entity Recognition (NER) model that performs sophisticated recognition of entities, including phone numbers, addresses, and dates, in order to return excellent formatted transcripts. NER is required for smart-formatted Nova-3 transcripts, and strongly suggested for all other speech-to-text transcripts.
- Consult our instructions for enabling NER formatting to make required updates to configuration as well as models.
- Improvements to smart-formatting have modified the formatting of some entity classes. Please contact your Deepgram account representative if you have any questions.
-
Introduces self-hosted support for our Voice Agent API. Contact your Deepgram account representative to get started with it.
-
Adds support for numeral formatting in 10 new languages. Use
numerals=true
. -
Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the
self-hosted-*
Quay repositories. For the next six months, bothonprem-*
andself-hosted-*
image repositories will receive identical image updates monthly, and we will announce image tags in theself-hosted
repositories. Subsequently, we will only publish new images toself-hosted-*
repos, deprecatingonprem-*
repository variants. -
Keeps our software up-to-date.
Container Images Release
Container Images (release 250130)
-
quay.io/deepgram/self-hosted-api:release-250130
- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.129.2
quay.io/deepgram/onprem-api:release-250130
quay.io/deepgram/onprem-api:1.129.2
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-250130
-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.79.2
quay.io/deepgram/onprem-engine:release-250130
quay.io/deepgram/onprem-engine:3.79.2
-
Minimum required NVIDIA driver version:
>=530.30.02
-
Maximum required NVIDIA driver version:
<=561.00.00
-
-
quay.io/deepgram/self-hosted-license-proxy:release-250130
- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.7.0
quay.io/deepgram/onprem-license-proxy:release-250130
quay.io/deepgram/onprem-license-proxy:1.7.0
- Equivalent image to:
-
quay.io/deepgram/self-hosted-billing:release-250130
- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.10.0
quay.io/deepgram/onprem-billing:release-250130
quay.io/deepgram/onprem-billing:1.10.0
- Equivalent image to:
This Release Contains The Following Changes
- Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the
self-hosted-*
Quay repositories. For the next seven months, bothonprem-*
andself-hosted-*
image repositories will receive identical image updates monthly, and we will announce image tags in theself-hosted
repositories. Subsequently, we will only publish new images toself-hosted-*
repos, deprecatingonprem-*
repository variants. - Keeps our software up-to-date.