June 30, 2026
Deepgram Self-Hosted June 2026 Release (260630)
Container Images (release 260630)
-
quay.io/deepgram/self-hosted-api:release-260630- Equivalent image to:
quay.io/deepgram/self-hosted-api:1.192.3-1
- Equivalent image to:
-
quay.io/deepgram/self-hosted-engine:release-260630-
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.119.1-1
-
Minimum required NVIDIA driver version:
>=570.172.08
-
-
quay.io/deepgram/self-hosted-license-proxy:release-260630- Equivalent image to:
quay.io/deepgram/self-hosted-license-proxy:1.10.1-2
- Equivalent image to:
-
quay.io/deepgram/self-hosted-billing:release-260630- Equivalent image to:
quay.io/deepgram/self-hosted-billing:1.13.0
- Equivalent image to:
Breaking Change: GET /v1/models Response Format
The self-hosted GET /v1/models endpoint now lists every loaded model, not just core speech-to-text (Nova) models. The response now also includes Text-to-Speech (Aura voice packs and voices) and helper models (entity detection/NER, diarization), groups results by model type, and indicates batch versus streaming mode for each speech-to-text model.
As part of this change, the response shape changes from a flat list to an object grouped by model type. Any integration that parses the /v1/models response must be updated to handle the new structure before upgrading to release-260630. See Model Metadata for the current response format.
This Release Contains The Following Changes
- Flux Word-Level Timestamps β each word in the
wordsarray now includesstartandendtimes on the Flux streaming endpoint (/v2/listen). This is an additive change. If your client library rejects unrecognized response fields, update to a current SDK version β for the Deepgram Python SDK, use6.0.1or later, or5.3.4if you are on the5.xline. See Flux Feature Overview for details. - Spanish Number Formatting Fix β βveintiunoβ now correctly formats as
21in Spanish transcripts (Nova-3 and Nova-2, including multilingual). - Additional Number and Currency Formatting β numeral expansion now applies to Flux multilingual transcripts; Romanian and Russian ordinal suffixes are supported; and currency (for example,
Bn$,Mn$,Tn$,Th$) and fiscal-quarter expressions format more consistently. - Streaming Timestamp Accuracy β corrected utterance timestamp ordering and improved word
startandendtime accuracy on streaming requests. - General Improvements β Keeps our software up-to-date.