Container Images Release

Deepgram Self-Hosted November 2025 Release (251118)

Container Images (release 251118)

  • quay.io/deepgram/self-hosted-api:release-251118

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.169.0
  • quay.io/deepgram/self-hosted-engine:release-251118

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.104.10
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-251118

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.9.2
  • quay.io/deepgram/self-hosted-billing:release-251118

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.12.1

This Release Contains The Following Changes

  • Expands Nova-3 Monolingual Language Support — Nova-3 now supports 11 additional languages, bringing stronger accuracy and contextual understanding across:

    • Eastern Europe and Eurasia: Bulgarian (bg), Czech (cs), Hungarian (hu), Polish (pl), Russian (ru), Ukrainian (uk)
    • Nordics and Baltics: Finnish (fi)
    • South Asia: Hindi (hi)
    • East Asia: Japanese (ja), Korean (ko, ko-KR)
    • Southeast Asia: Vietnamese (vi)

    Learn more in our announcement blog.

  • Adds 36-Language Detection Model — Adds support for a new language detection model that handles 36 languages. This feature requires enabling the use_v2_language_detection feature flag in the Engine TOML configuration. Language detection is available for pre-recorded audio only. Learn more in the language detection documentation.

  • Updates Status Endpoint — Updates the /v1/status endpoint to better reflect node startup and runtime state, preventing false Critical reports when the API starts before an Engine driver is ready. See the status endpoint documentation for the new status flow:

    • Initializing — Reported during node startup; transitions to Ready once initialization completes.
    • Ready — The node can service requests; transitions to Healthy after enough successful requests, or Critical if errors occur.
    • Healthy — Sustained success; can transition to Critical if failures arise.
    • Critical — Indicates node failures; can recover back to Ready once node can service requests again.
  • Enhances API Graceful Shutdown — Resolves an issue where the API container would not properly wait for outstanding work to complete before shutting down. The graceful shutdown period now defaults to approximately 10 minutes.

  • Improves Address Formatting — Improves formatting for street numbers in addresses.

  • Improves Aura-2 Latency Consistency — Improves latency consistency for Aura-2 text-to-speech requests.

  • Deprecates Legacy Intelligence Features — Legacy Intelligence features (analyze_sentiment=true, detect_topics=true, summarize=v1, and summarize=true v1 structure) are now deprecated in favor of newer versions. Requests using these parameters will return HTTP 400 errors. Migration guidance:

    • analyze_sentiment=true → use sentiment=true
    • detect_topics=true → use topics=true
    • summarize=v1 → use summarize=true or summarize=v2

    See the Speech-to-Text changelog for more details.

  • Includes General Improvements — Keeps our software up-to-date.


Container Images Release

Deepgram Self-Hosted October 2025 Release (251029)

Container Images (release 251029)

  • quay.io/deepgram/self-hosted-api:release-251029

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.167.1
  • quay.io/deepgram/self-hosted-engine:release-251029

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.104.7
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-251029

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.9.1
  • quay.io/deepgram/self-hosted-billing:release-251029

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.12.1

This Release Contains The Following Changes

  • General Improvements — Keeps our software up-to-date.

Container Images Release

Deepgram Self-Hosted October 2025 Release (251015)

Container Images (release 251015)

  • quay.io/deepgram/self-hosted-api:release-251015

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.166.0
  • quay.io/deepgram/self-hosted-engine:release-251015

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.104.7
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-251015

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.9.1
  • quay.io/deepgram/self-hosted-billing:release-251015

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.12.1

This Release Contains The Following Changes

  • Adds support for Flux, our new turn-based streaming speech-to-text model! Flux is purpose-built for voice agent use-cases.

    • Because of its resource usage, Flux must be run in its own self-hosted deployment separate from any other Deepgram models, even other speech-to-text models like Nova-3.
    • Connect to Flux’s new /v2/listen endpoint using model=flux-general-en, such as: http://localhost:8080/v2/listen?model=flux-general-en. Do not pass an additional language parameter.
    • Flux is currently English-only.
    • Contact your Deepgram account representative for access to the Flux model.
    • Get started with Flux’s developer docs and API reference.
    • Learn more in Flux’s changelog entry, Announcement Blog, and try our Interactive Demo.
    • To enable Flux in your self-hosted deployment, make the following additions to your TOML files:
      • api.toml:
        [features]
        listen_v2 = true
      • engine.toml:
        [flux]
        enabled = true
  • Resolves a Nova-3 diarization issue present in the prior release (release-250929).

  • General Improvements — Keeps our software up-to-date.


Container Images Release

Deepgram Self-Hosted September 2025 Release (250929)

Container Images (release 250929)

  • quay.io/deepgram/self-hosted-api:release-250929

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.159.2
  • quay.io/deepgram/self-hosted-engine:release-250929

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.104.3
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-250929

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.9.1
  • quay.io/deepgram/self-hosted-billing:release-250929

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.12.1

This Release Contains The Following Changes

⚠️ Important: Please refrain from using this release version if you run Nova-3 diarization.

  • General Improvements — Keeps our software up-to-date.

Container Images Release

Deepgram Self-Hosted September 2025 Release (250912)

Container Images (release 250912)

  • quay.io/deepgram/self-hosted-api:release-250912

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.159.2
  • quay.io/deepgram/self-hosted-engine:release-250912

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.101.1
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-250912

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.9.1
  • quay.io/deepgram/self-hosted-billing:release-250912

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.12.1

This Release Contains The Following Changes

  • GA support for Nova-3 monolingual German, Dutch, Swedish, Danish, Spanish, French, and Portuguese languages. Ask your Deepgram account representative for the new models.
  • Improves Utterance End settings to allow lower minimum utterance_end_ms values, aligning with configured step size for self-hosted deployments.
  • General Improvements — Keeps our software up-to-date.

Important: The onprem-* image tags are now deprecated and are no longer supported or updated. As announced in our previous release, we have transitioned to using only self-hosted-* repositories. Please update your deployments to use the self-hosted-* image tags listed above.


Container Images Release

Deepgram Self-Hosted August 2025 Release (250828)

Container Images (release 250828)

  • quay.io/deepgram/self-hosted-api:release-250828

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.156.1
      • quay.io/deepgram/onprem-api:release-250828
      • quay.io/deepgram/onprem-api:1.156.1
  • quay.io/deepgram/self-hosted-engine:release-250828

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.100.0
      • quay.io/deepgram/onprem-engine:release-250828
      • quay.io/deepgram/onprem-engine:3.100.0
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-250828

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250828
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250828

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.11.2
      • quay.io/deepgram/onprem-billing:release-250828
      • quay.io/deepgram/onprem-billing:1.11.2

This Release Contains The Following Changes

  • GPU Required Configuration — New [health] section in engine.toml with gpu_required configuration option allows Engine to fail on startup if no GPU is detected. While Engine can run without a GPU, production deployments require one for acceptable performance. Set gpu_required = true to fail fast if no GPU is available, rather than running with severely degraded performance. Default: false.
  • General Improvements — Keeps our software up-to-date.

Important: This is the last release that will include onprem-* image tags. The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. Starting with the next release in September 2025, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.


Container Images Release

Deepgram Self-Hosted August 2025 Release (250814)

Container Images (release 250814)

  • quay.io/deepgram/self-hosted-api:release-250814

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.156.1
      • quay.io/deepgram/onprem-api:release-250814
      • quay.io/deepgram/onprem-api:1.156.1
  • quay.io/deepgram/self-hosted-engine:release-250814

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.97.2
      • quay.io/deepgram/onprem-engine:release-250814
      • quay.io/deepgram/onprem-engine:3.97.2
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-250814

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250814
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250814

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.11.2
      • quay.io/deepgram/onprem-billing:release-250814
      • quay.io/deepgram/onprem-billing:1.11.2

This Release Contains The Following Changes

  • Aura-2 Self-Hosted — Now fully supports Deepgram’s Aura-2 text-to-speech service. You can refer to the Deploy TTS Services documentation for more details.
  • Automatic Engine Shutdown on GPU Errors — Deepgram Engine will now automatically stop if a GPU error is detected, preventing potential data corruption. You can disable this by setting the environment variable IMPELLER_DISABLE_AUTOSTOP=true.
  • Smarter ZIP Code Formatting — Now supports proper formatting for alphanumeric postal codes (e.g., UK, Canada).
  • General Improvements — Keeps our software up-to-date.

Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. Through August 2025, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Starting in September 2025, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.


Container Images Release

Deepgram Self-Hosted July 2025 Release (250731)

Container Images (release 250731)

  • quay.io/deepgram/self-hosted-api:release-250731

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.154.1
      • quay.io/deepgram/onprem-api:release-250731
      • quay.io/deepgram/onprem-api:1.154.1
  • quay.io/deepgram/self-hosted-engine:release-250731

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.94.0
      • quay.io/deepgram/onprem-engine:release-250731
      • quay.io/deepgram/onprem-engine:3.94.0
    • Minimum required NVIDIA driver version: >=550.163.01

  • quay.io/deepgram/self-hosted-license-proxy:release-250731

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250731
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250731

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.11.2
      • quay.io/deepgram/onprem-billing:release-250731
      • quay.io/deepgram/onprem-billing:1.11.2

This Release Contains The Following Changes

  • Adds redact_usage functionality to redact the values of keyterms and other URL parameters. This is now enabled by default, and may be toggled via the redact_usage boolean feature flag in api.toml. See our redact usage documentation for more information.
  • Adds targeted support for CUDA 12.8. We recommend updating to the latest CUDA 12.8 release for optimal performance and stability. See our driver installation doc for guidance on updating to newer NVIDIA driver and CUDA toolkit versions.
  • Returns a 400 Unknown Model error with reference to our error documentation when NER is misconfigured.
  • Ensures presence of a word-level language tag on multilingual transcripts that apply smart formatting.
  • Improves smart formatting of dates, alphanumerics, and numbers with units.
  • Keeps our software up-to-date.

Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. Through August 2025, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Starting in September 2025, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.


Container Images Release

Deepgram Self-Hosted July 2025 Release (250710)

Container Images (release 250710)

  • quay.io/deepgram/self-hosted-api:release-250710

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.151.8
      • quay.io/deepgram/onprem-api:release-250710
      • quay.io/deepgram/onprem-api:1.151.8
  • quay.io/deepgram/self-hosted-engine:release-250710

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.91.0
      • quay.io/deepgram/onprem-engine:release-250710
      • quay.io/deepgram/onprem-engine:3.91.0
    • Minimum required NVIDIA driver version: >=550.163.01

  • quay.io/deepgram/self-hosted-license-proxy:release-250710

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250710
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250710

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.11.2
      • quay.io/deepgram/onprem-billing:release-250710
      • quay.io/deepgram/onprem-billing:1.11.2

This Release Contains The Following Changes

  • Adds more verbose logging in Voice Agent for failures in TTS, LLM, and function-calling.
  • Improves redaction accuracy around entities and punctuation.
  • Keeps our software up-to-date.

Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. Through August 2025, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Starting in September 2025, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.


Container Images Release

Container Images (release 250626)

  • quay.io/deepgram/self-hosted-api:release-250626

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.150.2
      • quay.io/deepgram/onprem-api:release-250626
      • quay.io/deepgram/onprem-api:1.150.2
  • quay.io/deepgram/self-hosted-engine:release-250626

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.91.0
      • quay.io/deepgram/onprem-engine:release-250626
      • quay.io/deepgram/onprem-engine:3.91.0
    • Minimum required NVIDIA driver version: >=550.163.01

  • quay.io/deepgram/self-hosted-license-proxy:release-250626

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250626
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250626

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.11.2
      • quay.io/deepgram/onprem-billing:release-250626
      • quay.io/deepgram/onprem-billing:1.11.2

This Release Contains The Following Changes

  • Improves smart formatting for emails, alphanumerics, quantities, and percentages.
  • Expands language support for profanity filtering in German, Swiss German, Polish, Portuguese, Spanish, and Swedish.
  • Resolves an edge case when handling certain corrupt audio and now returns an HTTP 400 error code.
  • Keeps our software up-to-date.

Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. For the next three months, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Subsequently, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.