April 2, 2026

Deepgram Self-Hosted April 2026 Release (260402)

Container Images (release 260402)

  • quay.io/deepgram/self-hosted-api:release-260402

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.181.3
  • quay.io/deepgram/self-hosted-engine:release-260402

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.114.5
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-260402

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.10.1
  • quay.io/deepgram/self-hosted-billing:release-260402

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.13.0

This Release Contains The Following Changes

  • Certificate Endpoint Fix — Engine now responds to /v1/certificates in addition to /certificates, consistent with the other container images. See Certificate Status for details.
  • Model Name Consistency — The /v1/models endpoint now returns a canonical_name field matching the model name used in /v1/listen requests.
  • General Improvements — Keeps our software up-to-date.

March 31, 2026

Nova-3 Model Update

🌏 Nova-3 now supports the following new languages and language codes:

  • Chinese (Mandarin, Simplified): zh, zh-CN, zh-Hans
  • Chinese (Mandarin, Traditional): zh-TW, zh-Hant

Access these models by setting model="nova-3" and the relevant language code in your request.

Learn more about Nova-3 and supported languages on the Models and Language Overview page.


March 26, 2026

TTS speed controls & updated LLM models

TTS speak speed (Early Access)

You can now control the speaking rate of Deepgram TTS in the Voice Agent API using the agent.speak.provider.speed parameter. This parameter accepts a float value between 0.7 and 1.5, with 1.0 as the default.

1{
2 "type": "Settings",
3 "agent": {
4 "speak": {
5 "provider": {
6 "type": "deepgram",
7 "model": "aura-2-thalia-en",
8 "speed": 0.9
9 }
10 }
11 }
12}

This feature is in Early Access and is only available for Deepgram TTS. For more details, see TTS voice controls. To request access, contact your Account Executive or reach out to sales@deepgram.com.

Updated LLM models

New OpenAI models — Two new models are now available in the Standard pricing tier:

  • gpt-5.4-nano
  • gpt-5.4-mini

Gemini 2.0 Flash deprecated — The gemini-2.0-flash model is now deprecated. We recommend migrating to gemini-2.5-flash or a newer Gemini model. See the Google models table for alternatives.

For the full list of supported models and pricing tiers, see the Voice Agent LLM Models documentation.


March 19, 2026

Deepgram Self-Hosted March 2026 Release (260319)

Container Images (release 260319)

  • quay.io/deepgram/self-hosted-api:release-260319

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.180.1
  • quay.io/deepgram/self-hosted-engine:release-260319

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.114.4
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-260319

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.10.1
  • quay.io/deepgram/self-hosted-billing:release-260319

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.13.0

This Release Contains The Following Changes

  • Flux Regression Fix — Resolves Flux support regression from the 260305 release. See Deploy Flux Model (STT) for deployment details.
  • Nova-3 Language Expansion — New models: Thai (th, th-TH), Chinese Cantonese Traditional (zh-HK). Improved models: Bengali (bn), Marathi (mr), Tamil (ta), Telugu (te). See the full announcement for details.
  • Flux Status Metrics — Self-hosted status endpoint now includes Flux stream metrics. See Status Endpoint for details.
  • Certificate Status Endpoint — New /v1/certificates endpoint on all container images returns beginning-of-support, end-of-support, and end-of-life dates. See Certificate Status for details.
  • Log Formats — New configurable log output formats: Full, Compact, Pretty, Json. See Log Formats for configuration details.
  • General Improvements — Keeps our software up-to-date.

March 17, 2026

Nova-3 Model Update

🌏 Nova-3 now supports the following new languages and language codes:

  • Chinese (Cantonese, Traditional): zh-HK
  • Thai: th, th-TH

🚀 Also releasing improved Nova-3 models for the following languages:

  • Bengali (bn)
  • Marathi (mr)
  • Tamil (ta)
  • Telugu (te)

Access these models by setting model="nova-3" and the relevant language code in your request.

Learn more about Nova-3 and supported languages on the Models and Language Overview page.


March 16, 2026

🤖 New LLM Models Support & Bug Fixes

We’ve added support for new LLM models in the Voice Agent API:

  • OpenAI GPT-5.3 Instant (gpt-5.3-chat-latest)
  • OpenAI GPT 5.4 (gpt-5.4)
  • Google Gemini 3.1 Flash Lite (gemini-3.1-flash-lite)

Example:

1{
2 "type": "Settings",
3 "agent": {
4 "think": {
5 "provider": {
6 "type": "open_ai",
7 "model": "gpt-5.3-chat-latest"
8 }
9 }
10 }
11}

For the full list of supported models and pricing tiers, visit our Voice Agent LLM Models documentation.

Fixes

  • Resolves an issue where the GPT-5.2 Instant model used an incorrect model ID and pricing tier. The model now uses the correct ID (gpt-5.2-chat-latest) and is assigned to the Advanced tier.

March 10, 2026

Nova-3 Model Update

🎯 Nova-3 Swedish and Dutch Model Enhancements

We’ve released updated Nova-3 Swedish and Nova-3 Dutch models, offering improved accuracy for both streaming and batch transcription.

Access these models by setting model: "nova-3" and the relevant language code:

  • Swedish (sv, sv-SE)
  • Dutch (nl)

Learn more about Nova-3 on the Models and Language Overview page.


March 9, 2026

Reasoning mode for OpenAI thinking models

You can now control the reasoning effort of supported OpenAI reasoning models using the new reasoning_mode parameter in the think provider configuration. This parameter maps to OpenAI’s reasoning_effort and accepts low, medium, or high.

Example:

1{
2 "type": "Settings",
3 "agent": {
4 "think": {
5 "provider": {
6 "type": "open_ai",
7 "model": "gpt-5",
8 "reasoning_mode": "medium"
9 }
10 }
11 }
12}

For more details, visit the Configure the Voice Agent documentation.


Model Improvement Program pricing update

Pay as you Go and Growth customers can now opt in or out of the Model Improvement Program with no impact on the rates listed on deepgram.com/pricing.


March 5, 2026

Deepgram Self-Hosted March 2026 Release (260305)

We are aware of an issue with Flux in this release. Do not use this release for Flux deployments. This is resolved in the 260319 release.

Container Images (release 260305)

  • quay.io/deepgram/self-hosted-api:release-260305

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.179.5
  • quay.io/deepgram/self-hosted-engine:release-260305

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.113.2
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-260305

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.10.1
  • quay.io/deepgram/self-hosted-billing:release-260305

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.12.1

This Release Contains The Following Changes

  • Nova-3 Right-to-Left Language Support — Nova-3 now supports Arabic, Hebrew, Farsi, and Urdu. See the full announcement for details.
  • Nova-3 Multilingual Model Update — Accuracy improvements across all supported languages, with the largest gains in code-switching scenarios. See the full announcement for details.
  • Abbreviated Dates in Smart Formatting — Smart formatting now recognizes and formats abbreviated dates.
  • General Improvements — Keeps our software up-to-date.