April 30, 2026
Deepgram Self-Hosted April 2026 Release (260430)
Container Images (release 260430)
-
quay.io/deepgram/self-hosted-api:release-260430- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.185.0-2
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-260430-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.116.0-1
-
Minimum required NVIDIA driver version:
>=570.172.08
-
-
quay.io/deepgram/self-hosted-license-proxy:release-260430- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.10.1
- Equivalent image to:
-
quay.io/deepgram/self-hosted-billing:release-260430- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.13.0
- Equivalent image to:
Aura-2 Speed and Pronunciation Controls require an updated voice-pack
The new Aura-2 Speed and Pronunciation Control features in this release are powered by an updated Aura-2 English voice-pack model. If your deployment is using an Aura-2 English voice-pack from before the April 2026 release (e.g., the 2025-04-15.0 version of the voice-pack), requests including the speed or pronounce parameters will return 400 Bad Request.
To enable these features, contact your Deepgram representative to obtain the latest Aura-2 English voice-pack (2025-04-15.4 or later) and replace the existing voice-pack file in your models directory. The official Deepgram Helm chart and sample values files in deepgram/self-hosted-resources (chart 0.34.0 and later) already point to the correct UUID; you only need to use the latest Deepgram configuration files and update the model file on disk.
This Release Contains The Following Changes
- Nova-3 Gujarati — Nova-3 now supports Gujarati (
gu) for both batch and streaming. - Aura-2 Speed and Pronunciation Controls — Aura-2 TTS voices now support runtime speed and pronunciation control. See Voice Controls for details.
- Improved Aura-2 Pronunciation — Better pronunciation for Spanish dates and the term “Jan” (as a name versus a month) with Aura-2 voices.
- Nova-3 Multilingual Numeral Formatting — Numeral formatting is now applied when using Nova-3 multilingual models and
smart_formatornumeralsis enabled. - Numeral Formatting for Hebrew and Romanian — Numeral formatting is now applied for Nova-3 Hebrew (
he) and Romanian (ro) whensmart_formatornumeralsis enabled. - Voice Agent: Cartesia Speed Control — The Cartesia speak provider now supports speed control in Voice Agent sessions.
- Voice Agent: Improved Agent Message Injection — Improved support for injecting agent messages into a live session. See Inject Agent for details.
- Voice Agent: Multilingual Flux Language Hints — Multilingual Flux now accepts language hints when used as the STT provider in a Voice Agent session.
- Improved Multilingual Streaming Language Tags — Improves the accuracy of language tag results on
/v1/listenstreaming requests using multilingual models. - Improved Numeral Redaction Accuracy — Improved redaction accuracy when using
redact=numbersorredact=aggressive_numbers. - General Improvements — Keeps our software up-to-date.