Container Images Release

Deepgram Self-Hosted October 2025 Release (251029)

Container Images (release 251029)

  • quay.io/deepgram/self-hosted-api:release-251029

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.167.1
  • quay.io/deepgram/self-hosted-engine:release-251029

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.104.7
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-251029

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.9.1
  • quay.io/deepgram/self-hosted-billing:release-251029

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.12.1

This Release Contains The Following Changes

  • General Improvements — Keeps our software up-to-date.

Container Images Release

Deepgram Self-Hosted October 2025 Release (251015)

Container Images (release 251015)

  • quay.io/deepgram/self-hosted-api:release-251015

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.166.0
  • quay.io/deepgram/self-hosted-engine:release-251015

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.104.7
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-251015

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.9.1
  • quay.io/deepgram/self-hosted-billing:release-251015

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.12.1

This Release Contains The Following Changes

  • Adds support for Flux, our new turn-based streaming speech-to-text model! Flux is purpose-built for voice agent use-cases.

    • Because of its resource usage, Flux must be run in its own self-hosted deployment separate from any other Deepgram models, even other speech-to-text models like Nova-3.
    • Connect to Flux’s new /v2/listen endpoint using model=flux-general-en, such as: http://localhost:8080/v2/listen?model=flux-general-en. Do not pass an additional language parameter.
    • Flux is currently English-only.
    • Contact your Deepgram account representative for access to the Flux model.
    • Get started with Flux’s developer docs and API reference.
    • Learn more in Flux’s changelog entry, Announcement Blog, and try our Interactive Demo.
    • To enable Flux in your self-hosted deployment, make the following additions to your TOML files:
      • api.toml:
        [features]
        listen_v2 = true
      • engine.toml:
        [flux]
        enabled = true
  • Resolves a Nova-3 diarization issue present in the prior release (release-250929).

  • General Improvements — Keeps our software up-to-date.


Container Images Release

Deepgram Self-Hosted September 2025 Release (250929)

Container Images (release 250929)

  • quay.io/deepgram/self-hosted-api:release-250929

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.159.2
  • quay.io/deepgram/self-hosted-engine:release-250929

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.104.3
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-250929

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.9.1
  • quay.io/deepgram/self-hosted-billing:release-250929

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.12.1

This Release Contains The Following Changes

⚠️ Important: Please refrain from using this release version if you run Nova-3 diarization.

  • General Improvements — Keeps our software up-to-date.

Container Images Release

Deepgram Self-Hosted September 2025 Release (250912)

Container Images (release 250912)

  • quay.io/deepgram/self-hosted-api:release-250912

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.159.2
  • quay.io/deepgram/self-hosted-engine:release-250912

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.101.1
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-250912

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.9.1
  • quay.io/deepgram/self-hosted-billing:release-250912

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.12.1

This Release Contains The Following Changes

  • GA support for Nova-3 monolingual German, Dutch, Swedish, Danish, Spanish, French, and Portuguese languages. Ask your Deepgram account representative for the new models.
  • Improves Utterance End settings to allow lower minimum utterance_end_ms values, aligning with configured step size for self-hosted deployments.
  • General Improvements — Keeps our software up-to-date.

Important: The onprem-* image tags are now deprecated and are no longer supported or updated. As announced in our previous release, we have transitioned to using only self-hosted-* repositories. Please update your deployments to use the self-hosted-* image tags listed above.


Container Images Release

Deepgram Self-Hosted August 2025 Release (250828)

Container Images (release 250828)

  • quay.io/deepgram/self-hosted-api:release-250828

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.156.1
      • quay.io/deepgram/onprem-api:release-250828
      • quay.io/deepgram/onprem-api:1.156.1
  • quay.io/deepgram/self-hosted-engine:release-250828

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.100.0
      • quay.io/deepgram/onprem-engine:release-250828
      • quay.io/deepgram/onprem-engine:3.100.0
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-250828

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250828
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250828

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.11.2
      • quay.io/deepgram/onprem-billing:release-250828
      • quay.io/deepgram/onprem-billing:1.11.2

This Release Contains The Following Changes

  • GPU Required Configuration — New [health] section in engine.toml with gpu_required configuration option allows Engine to fail on startup if no GPU is detected. While Engine can run without a GPU, production deployments require one for acceptable performance. Set gpu_required = true to fail fast if no GPU is available, rather than running with severely degraded performance. Default: false.
  • General Improvements — Keeps our software up-to-date.

Important: This is the last release that will include onprem-* image tags. The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. Starting with the next release in September 2025, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.


Container Images Release

Deepgram Self-Hosted August 2025 Release (250814)

Container Images (release 250814)

  • quay.io/deepgram/self-hosted-api:release-250814

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.156.1
      • quay.io/deepgram/onprem-api:release-250814
      • quay.io/deepgram/onprem-api:1.156.1
  • quay.io/deepgram/self-hosted-engine:release-250814

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.97.2
      • quay.io/deepgram/onprem-engine:release-250814
      • quay.io/deepgram/onprem-engine:3.97.2
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-250814

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250814
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250814

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.11.2
      • quay.io/deepgram/onprem-billing:release-250814
      • quay.io/deepgram/onprem-billing:1.11.2

This Release Contains The Following Changes

  • Aura-2 Self-Hosted — Now fully supports Deepgram’s Aura-2 text-to-speech service. You can refer to the Deploy TTS Services documentation for more details.
  • Automatic Engine Shutdown on GPU Errors — Deepgram Engine will now automatically stop if a GPU error is detected, preventing potential data corruption. You can disable this by setting the environment variable IMPELLER_DISABLE_AUTOSTOP=true.
  • Smarter ZIP Code Formatting — Now supports proper formatting for alphanumeric postal codes (e.g., UK, Canada).
  • General Improvements — Keeps our software up-to-date.

Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. Through August 2025, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Starting in September 2025, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.


Container Images Release

Deepgram Self-Hosted July 2025 Release (250731)

Container Images (release 250731)

  • quay.io/deepgram/self-hosted-api:release-250731

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.154.1
      • quay.io/deepgram/onprem-api:release-250731
      • quay.io/deepgram/onprem-api:1.154.1
  • quay.io/deepgram/self-hosted-engine:release-250731

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.94.0
      • quay.io/deepgram/onprem-engine:release-250731
      • quay.io/deepgram/onprem-engine:3.94.0
    • Minimum required NVIDIA driver version: >=550.163.01

  • quay.io/deepgram/self-hosted-license-proxy:release-250731

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250731
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250731

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.11.2
      • quay.io/deepgram/onprem-billing:release-250731
      • quay.io/deepgram/onprem-billing:1.11.2

This Release Contains The Following Changes

  • Adds redact_usage functionality to redact the values of keyterms and other URL parameters. This is now enabled by default, and may be toggled via the redact_usage boolean feature flag in api.toml. See our redact usage documentation for more information.
  • Adds targeted support for CUDA 12.8. We recommend updating to the latest CUDA 12.8 release for optimal performance and stability. See our driver installation doc for guidance on updating to newer NVIDIA driver and CUDA toolkit versions.
  • Returns a 400 Unknown Model error with reference to our error documentation when NER is misconfigured.
  • Ensures presence of a word-level language tag on multilingual transcripts that apply smart formatting.
  • Improves smart formatting of dates, alphanumerics, and numbers with units.
  • Keeps our software up-to-date.

Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. Through August 2025, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Starting in September 2025, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.


Container Images Release

Deepgram Self-Hosted July 2025 Release (250710)

Container Images (release 250710)

  • quay.io/deepgram/self-hosted-api:release-250710

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.151.8
      • quay.io/deepgram/onprem-api:release-250710
      • quay.io/deepgram/onprem-api:1.151.8
  • quay.io/deepgram/self-hosted-engine:release-250710

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.91.0
      • quay.io/deepgram/onprem-engine:release-250710
      • quay.io/deepgram/onprem-engine:3.91.0
    • Minimum required NVIDIA driver version: >=550.163.01

  • quay.io/deepgram/self-hosted-license-proxy:release-250710

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250710
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250710

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.11.2
      • quay.io/deepgram/onprem-billing:release-250710
      • quay.io/deepgram/onprem-billing:1.11.2

This Release Contains The Following Changes

  • Adds more verbose logging in Voice Agent for failures in TTS, LLM, and function-calling.
  • Improves redaction accuracy around entities and punctuation.
  • Keeps our software up-to-date.

Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. Through August 2025, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Starting in September 2025, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.


Container Images Release

Container Images (release 250626)

  • quay.io/deepgram/self-hosted-api:release-250626

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.150.2
      • quay.io/deepgram/onprem-api:release-250626
      • quay.io/deepgram/onprem-api:1.150.2
  • quay.io/deepgram/self-hosted-engine:release-250626

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.91.0
      • quay.io/deepgram/onprem-engine:release-250626
      • quay.io/deepgram/onprem-engine:3.91.0
    • Minimum required NVIDIA driver version: >=550.163.01

  • quay.io/deepgram/self-hosted-license-proxy:release-250626

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250626
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250626

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.11.2
      • quay.io/deepgram/onprem-billing:release-250626
      • quay.io/deepgram/onprem-billing:1.11.2

This Release Contains The Following Changes

  • Improves smart formatting for emails, alphanumerics, quantities, and percentages.
  • Expands language support for profanity filtering in German, Swiss German, Polish, Portuguese, Spanish, and Swedish.
  • Resolves an edge case when handling certain corrupt audio and now returns an HTTP 400 error code.
  • Keeps our software up-to-date.

Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. For the next three months, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Subsequently, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.


Container Images Release

Container Images (release 250610)

  • quay.io/deepgram/self-hosted-api:release-250610

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.146.1
      • quay.io/deepgram/onprem-api:release-250610
      • quay.io/deepgram/onprem-api:1.146.1
  • quay.io/deepgram/self-hosted-engine:release-250610

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.89.2
      • quay.io/deepgram/onprem-engine:release-250610
      • quay.io/deepgram/onprem-engine:3.89.2
    • Minimum required NVIDIA driver version: >=550.163.01

  • quay.io/deepgram/self-hosted-license-proxy:release-250610

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250610
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250610

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.11.2
      • quay.io/deepgram/onprem-billing:release-250610
      • quay.io/deepgram/onprem-billing:1.11.2

This Release Contains The Following Changes

  • Adds full support for Voice Agent v1 API.
  • Addresses an issue with the engine_active_requests metric for streaming STT auto-scaling.
  • Resolves an issue with year formatting for smart_format.
  • Keeps our software up-to-date.

Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. For the next four months, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Subsequently, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.