Container Images Release

Deepgram Self-Hosted July 2025 Release (250731)

Container Images (release 250731)

  • quay.io/deepgram/self-hosted-api:release-250731

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.154.1
      • quay.io/deepgram/onprem-api:release-250731
      • quay.io/deepgram/onprem-api:1.154.1
  • quay.io/deepgram/self-hosted-engine:release-250731

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.94.0
      • quay.io/deepgram/onprem-engine:release-250731
      • quay.io/deepgram/onprem-engine:3.94.0
    • Minimum required NVIDIA driver version: >=550.163.01

  • quay.io/deepgram/self-hosted-license-proxy:release-250731

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250731
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250731

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.11.2
      • quay.io/deepgram/onprem-billing:release-250731
      • quay.io/deepgram/onprem-billing:1.11.2

This Release Contains The Following Changes

  • Adds redact_usage functionality to redact the values of keyterms and other URL parameters. This is now enabled by default, and may be toggled via the redact_usage boolean feature flag in api.toml. See our redact usage documentation for more information.
  • Adds targeted support for CUDA 12.8. We recommend updating to the latest CUDA 12.8 release for optimal performance and stability. See our driver installation doc for guidance on updating to newer NVIDIA driver and CUDA toolkit versions.
  • Returns a 400 Unknown Model error with reference to our error documentation when NER is misconfigured.
  • Ensures presence of a word-level language tag on multilingual transcripts that apply smart formatting.
  • Improves smart formatting of dates, alphanumerics, and numbers with units.
  • Keeps our software up-to-date.

Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. Through August 2025, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Starting in September 2025, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.

Container Images Release

Deepgram Self-Hosted July 2025 Release (250710)

Container Images (release 250710)

  • quay.io/deepgram/self-hosted-api:release-250710

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.151.8
      • quay.io/deepgram/onprem-api:release-250710
      • quay.io/deepgram/onprem-api:1.151.8
  • quay.io/deepgram/self-hosted-engine:release-250710

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.91.0
      • quay.io/deepgram/onprem-engine:release-250710
      • quay.io/deepgram/onprem-engine:3.91.0
    • Minimum required NVIDIA driver version: >=550.163.01

  • quay.io/deepgram/self-hosted-license-proxy:release-250710

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250710
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250710

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.11.2
      • quay.io/deepgram/onprem-billing:release-250710
      • quay.io/deepgram/onprem-billing:1.11.2

This Release Contains The Following Changes

  • Adds more verbose logging in Voice Agent for failures in TTS, LLM, and function-calling.
  • Improves redaction accuracy around entities and punctuation.
  • Keeps our software up-to-date.

Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. Through August 2025, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Starting in September 2025, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.

Container Images Release

Container Images (release 250626)

  • quay.io/deepgram/self-hosted-api:release-250626

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.150.2
      • quay.io/deepgram/onprem-api:release-250626
      • quay.io/deepgram/onprem-api:1.150.2
  • quay.io/deepgram/self-hosted-engine:release-250626

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.91.0
      • quay.io/deepgram/onprem-engine:release-250626
      • quay.io/deepgram/onprem-engine:3.91.0
    • Minimum required NVIDIA driver version: >=550.163.01

  • quay.io/deepgram/self-hosted-license-proxy:release-250626

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250626
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250626

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.11.2
      • quay.io/deepgram/onprem-billing:release-250626
      • quay.io/deepgram/onprem-billing:1.11.2

This Release Contains The Following Changes

  • Improves smart formatting for emails, alphanumerics, quantities, and percentages.
  • Expands language support for profanity filtering in German, Swiss German, Polish, Portuguese, Spanish, and Swedish.
  • Resolves an edge case when handling certain corrupt audio and now returns an HTTP 400 error code.
  • Keeps our software up-to-date.

Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. For the next three months, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Subsequently, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.

Container Images Release

Container Images (release 250610)

  • quay.io/deepgram/self-hosted-api:release-250610

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.146.1
      • quay.io/deepgram/onprem-api:release-250610
      • quay.io/deepgram/onprem-api:1.146.1
  • quay.io/deepgram/self-hosted-engine:release-250610

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.89.2
      • quay.io/deepgram/onprem-engine:release-250610
      • quay.io/deepgram/onprem-engine:3.89.2
    • Minimum required NVIDIA driver version: >=550.163.01

  • quay.io/deepgram/self-hosted-license-proxy:release-250610

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250610
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250610

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.11.2
      • quay.io/deepgram/onprem-billing:release-250610
      • quay.io/deepgram/onprem-billing:1.11.2

This Release Contains The Following Changes

  • Adds full support for Voice Agent v1 API.
  • Addresses an issue with the engine_active_requests metric for streaming STT auto-scaling.
  • Resolves an issue with year formatting for smart_format.
  • Keeps our software up-to-date.

Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. For the next four months, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Subsequently, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.

Container Images Release

Container Images (release 250505)

  • quay.io/deepgram/self-hosted-api:release-250505

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.142.1
      • quay.io/deepgram/onprem-api:release-250505
      • quay.io/deepgram/onprem-api:1.142.1
  • quay.io/deepgram/self-hosted-engine:release-250505

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.89.0
      • quay.io/deepgram/onprem-engine:release-250505
      • quay.io/deepgram/onprem-engine:3.89.0
    • Minimum required NVIDIA driver version: >=550.163.01

  • quay.io/deepgram/self-hosted-license-proxy:release-250505

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250505
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250505

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.11.2
      • quay.io/deepgram/onprem-billing:release-250505
      • quay.io/deepgram/onprem-billing:1.11.2

This Release Contains The Following Changes

  • Extends numeral formatting for supported languages when using detect_language=true.
  • Improves formatting of dates.
  • Resolves an issue with Whisper functionality.
  • Keeps our software up-to-date.

Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. For the next five months, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Subsequently, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.

Container Images Release

Container Images (release 250307)

  • quay.io/deepgram/self-hosted-api:release-250307

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.139.0
      • quay.io/deepgram/onprem-api:release-250307
      • quay.io/deepgram/onprem-api:1.139.0
  • quay.io/deepgram/self-hosted-engine:release-250307

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.84.0
      • quay.io/deepgram/onprem-engine:release-250307
      • quay.io/deepgram/onprem-engine:3.84.0
    • Minimum required NVIDIA driver version: >=530.30.02

    • Maximum required NVIDIA driver version: <=561.00.00

  • quay.io/deepgram/self-hosted-license-proxy:release-250307

    • It is essential to upgrade the license proxy before upgrading the API or Engine for this release. Neglecting to upgrade the license proxy will cause a breaking change on deployment.
    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.8.0
      • quay.io/deepgram/onprem-license-proxy:release-250307
      • quay.io/deepgram/onprem-license-proxy:1.8.0
  • quay.io/deepgram/self-hosted-billing:release-250307

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.10.0
      • quay.io/deepgram/onprem-billing:release-250307
      • quay.io/deepgram/onprem-billing:1.10.0

This Release Contains The Following Changes

  • Adds support for our new class of English Nova-3 models, including both General and Medical. Use model=nova-3 or model=nova-3-medical. For further information, see our Model Selection documentation. Contact your Deepgram account representative for access to Nova-3. Please note that multilingual Nova-3 will be available in a later release.

  • Significantly improves formatting for all English transcripts. This new formatting is powered by a dedicated Named Entity Recognition (NER) model that performs sophisticated recognition of entities, including phone numbers, addresses, and dates, in order to return excellent formatted transcripts. NER is required for smart-formatted Nova-3 transcripts, and strongly suggested for all other speech-to-text transcripts.

    • Consult our instructions for enabling NER formatting to make required updates to configuration as well as models.
    • Improvements to smart-formatting have modified the formatting of some entity classes. Please contact your Deepgram account representative if you have any questions.
  • Introduces self-hosted support for our Voice Agent API. Contact your Deepgram account representative to get started with it.

  • Adds support for numeral formatting in 10 new languages. Use numerals=true.

  • Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. For the next six months, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Subsequently, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.

  • Keeps our software up-to-date.

Container Images Release

Container Images (release 250130)

  • quay.io/deepgram/self-hosted-api:release-250130

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.129.2
      • quay.io/deepgram/onprem-api:release-250130
      • quay.io/deepgram/onprem-api:1.129.2
  • quay.io/deepgram/self-hosted-engine:release-250130

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.79.2
      • quay.io/deepgram/onprem-engine:release-250130
      • quay.io/deepgram/onprem-engine:3.79.2
    • Minimum required NVIDIA driver version: >=530.30.02

    • Maximum required NVIDIA driver version: <=561.00.00

  • quay.io/deepgram/self-hosted-license-proxy:release-250130

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.7.0
      • quay.io/deepgram/onprem-license-proxy:release-250130
      • quay.io/deepgram/onprem-license-proxy:1.7.0
  • quay.io/deepgram/self-hosted-billing:release-250130

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.10.0
      • quay.io/deepgram/onprem-billing:release-250130
      • quay.io/deepgram/onprem-billing:1.10.0

This Release Contains The Following Changes

  • Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. For the next seven months, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Subsequently, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.
  • Keeps our software up-to-date.

Container Images Release

Container Images (release 241226)

  • quay.io/deepgram/self-hosted-api:release-241226

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.127.0
      • quay.io/deepgram/onprem-api:release-241226
      • quay.io/deepgram/onprem-api:1.127.0
  • quay.io/deepgram/self-hosted-engine:release-241226

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.77.4
      • quay.io/deepgram/onprem-engine:release-241226
      • quay.io/deepgram/onprem-engine:3.77.4
    • Minimum required NVIDIA driver version: >=530.30.02

    • Maximum required NVIDIA driver version: <=561.00.00

  • quay.io/deepgram/self-hosted-license-proxy:release-241226

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.7.0
      • quay.io/deepgram/onprem-license-proxy:release-241226
      • quay.io/deepgram/onprem-license-proxy:1.7.0
  • quay.io/deepgram/self-hosted-billing:release-241226

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.10.0
      • quay.io/deepgram/onprem-billing:release-241226
      • quay.io/deepgram/onprem-billing:1.10.0

This Release Contains The Following Changes

  • Improves profanity filtering. No model update is required. Use the request parameter profanity_filter=true.
  • Increases robustness of handling TTS text inputs.
  • Adds new streaming latency metrics for monitoring latency in Grafana or similar applications.
  • Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. For the next eight months, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Subsequently, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.
  • Keeps our software up-to-date.

Container Images Release

Container Images (release 241121)

  • quay.io/deepgram/self-hosted-api:release-241121

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.124.2
      • quay.io/deepgram/onprem-api:release-241121
      • quay.io/deepgram/onprem-api:1.124.2
  • quay.io/deepgram/self-hosted-engine:release-241121

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.76.1
      • quay.io/deepgram/onprem-engine:release-241121
      • quay.io/deepgram/onprem-engine:3.76.1
    • Minimum required NVIDIA driver version: >=525.60.13

  • quay.io/deepgram/self-hosted-license-proxy:release-241121

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.7.0
      • quay.io/deepgram/onprem-license-proxy:release-241121
      • quay.io/deepgram/onprem-license-proxy:1.7.0
  • quay.io/deepgram/self-hosted-billing:release-241121

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.10.0
      • quay.io/deepgram/onprem-billing:release-241121
      • quay.io/deepgram/onprem-billing:1.10.0

This Release Contains The Following Changes

  • Increases the maximum throughput for pre-recorded transcription of longer audio durations.
  • Improves beta streaming entity formatting.
  • Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. For the next nine months, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Subsequently, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.
  • Keeps our software up-to-date.

Container Images Release

Container Images (release 241024)

  • quay.io/deepgram/self-hosted-api:release-241024

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.122.0
      • quay.io/deepgram/onprem-api:release-241024
      • quay.io/deepgram/onprem-api:1.122.0
  • quay.io/deepgram/self-hosted-engine:release-241024

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.75.5
      • quay.io/deepgram/onprem-engine:release-241024
      • quay.io/deepgram/onprem-engine:3.75.5
    • Minimum required NVIDIA driver version: >=525.60.13

  • quay.io/deepgram/self-hosted-license-proxy:release-241024

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.7.0
      • quay.io/deepgram/onprem-license-proxy:release-241024
      • quay.io/deepgram/onprem-license-proxy:1.7.0
  • quay.io/deepgram/self-hosted-billing:release-241024

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.10.0
      • quay.io/deepgram/onprem-billing:release-241024
      • quay.io/deepgram/onprem-billing:1.10.0

This Release Contains The Following Changes

  • Adds new streaming websocket TTS!

    • This is a software feature, so no new TTS models are required.
    • To enable, add the feature flag speak_streaming = true in the [features] section of your api.toml file.
    • See the help center guide for further details.
  • Improves beta streaming entity formatting.

  • Reminder: The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. For the next ten months, both onprem-* and self-hosted-* image repositories will receive identical image updates monthly, and we will announce image tags in the self-hosted repositories. Subsequently, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.

  • Keeps our software up-to-date.