Container Images Release

Deepgram Self-Hosted October 2025 Release (251015)

Container Images (release 251015)

  • quay.io/deepgram/self-hosted-api:release-251015

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.166.0
  • quay.io/deepgram/self-hosted-engine:release-251015

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.104.7
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-251015

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.9.1
  • quay.io/deepgram/self-hosted-billing:release-251015

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.12.1

This Release Contains The Following Changes

  • Adds support for Flux, our new turn-based streaming speech-to-text model! Flux is purpose-built for voice agent use-cases.

    • Because of its resource usage, Flux must be run in its own self-hosted deployment separate from any other Deepgram models, even other speech-to-text models like Nova-3.
    • Connect to Flux’s new /v2/listen endpoint using model=flux-general-en, such as: http://localhost:8080/v2/listen?model=flux-general-en. Do not pass an additional language parameter.
    • Flux is currently English-only.
    • Contact your Deepgram account representative for access to the Flux model.
    • Get started with Flux’s developer docs and API reference.
    • Learn more in Flux’s changelog entry, Announcement Blog, and try our Interactive Demo.
    • To enable Flux in your self-hosted deployment, make the following additions to your TOML files:
      • api.toml:
        [features]
        listen_v2 = true
      • engine.toml:
        [flux]
        enabled = true
  • General Improvements — Keeps our software up-to-date.