April 30, 2026

Deepgram Self-Hosted April 2026 Release (260430)

Container Images (release 260430)

quay.io/deepgram/self-hosted-api:release-260430
- Equivalent image to:
  - quay.io/deepgram/self-hosted-api:1.185.0-2
quay.io/deepgram/self-hosted-engine:release-260430
- Equivalent image to:
  - quay.io/deepgram/self-hosted-engine:3.116.0-1
- Minimum required NVIDIA driver version: >=570.172.08
quay.io/deepgram/self-hosted-license-proxy:release-260430
- Equivalent image to:
  - quay.io/deepgram/self-hosted-license-proxy:1.10.1
quay.io/deepgram/self-hosted-billing:release-260430
- Equivalent image to:
  - quay.io/deepgram/self-hosted-billing:1.13.0

Aura-2 Speed and Pronunciation Controls require an updated voice-pack

The new Aura-2 Speed and Pronunciation Control features in this release are powered by an updated Aura-2 English voice-pack model. If your deployment is using an Aura-2 English voice-pack from before the April 2026 release (e.g., the 2025-04-15.0 version of the voice-pack), requests including the speed or pronounce parameters will return 400 Bad Request.

To enable these features, contact your Deepgram representative to obtain the latest Aura-2 English voice-pack (2025-04-15.4 or later) and replace the existing voice-pack file in your models directory. The official Deepgram Helm chart and sample values files in deepgram/self-hosted-resources (chart 0.34.0 and later) already point to the correct UUID; you only need to use the latest Deepgram configuration files and update the model file on disk.

This Release Contains The Following Changes

Nova-3 Gujarati — Nova-3 now supports Gujarati (gu) for both batch and streaming.
Aura-2 Speed and Pronunciation Controls — Aura-2 TTS voices now support runtime speed and pronunciation control. See Voice Controls for details.
Improved Aura-2 Pronunciation — Better pronunciation for Spanish dates and the term “Jan” (as a name versus a month) with Aura-2 voices.
Nova-3 Multilingual Numeral Formatting — Numeral formatting is now applied when using Nova-3 multilingual models and smart_format or numerals is enabled.
Numeral Formatting for Hebrew and Romanian — Numeral formatting is now applied for Nova-3 Hebrew (he) and Romanian (ro) when smart_format or numerals is enabled.
Voice Agent: Cartesia Speed Control — The Cartesia speak provider now supports speed control in Voice Agent sessions.
Voice Agent: Improved Agent Message Injection — Improved support for injecting agent messages into a live session. See Inject Agent for details.
Voice Agent: Multilingual Flux Language Hints — Multilingual Flux now accepts language hints when used as the STT provider in a Voice Agent session.
Improved Multilingual Streaming Language Tags — Improves the accuracy of language tag results on /v1/listen streaming requests using multilingual models.
Improved Numeral Redaction Accuracy — Improved redaction accuracy when using redact=numbers or redact=aggressive_numbers.
General Improvements — Keeps our software up-to-date.