October 15, 2025 | Deepgram's Docs

Container Images Release

quay.io/deepgram/self-hosted-api:release-251015
- Equivalent image to:
  - quay.io/deepgram/self-hosted-api:1.166.0
quay.io/deepgram/self-hosted-engine:release-251015
- Equivalent image to:
  - quay.io/deepgram/self-hosted-engine:3.104.7
- Minimum required NVIDIA driver version: >=570.172.08
quay.io/deepgram/self-hosted-license-proxy:release-251015
- Equivalent image to:
  - quay.io/deepgram/self-hosted-license-proxy:1.9.1
quay.io/deepgram/self-hosted-billing:release-251015
- Equivalent image to:
  - quay.io/deepgram/self-hosted-billing:1.12.1

Adds support for Flux, our new turn-based streaming speech-to-text model! Flux is purpose-built for voice agent use-cases.
- Because of its resource usage, Flux must be run in its own self-hosted deployment separate from any other Deepgram models, even other speech-to-text models like Nova-3.
- Connect to Flux’s new /v2/listen endpoint using model=flux-general-en, such as: http://localhost:8080/v2/listen?model=flux-general-en. Do not pass an additional language parameter.
- Flux is currently English-only.
- Contact your Deepgram account representative for access to the Flux model.
- Get started with Flux’s developer docs and API reference.
- Learn more in Flux’s changelog entry, Announcement Blog, and try our Interactive Demo.
- To enable Flux in your self-hosted deployment, make the following additions to your TOML files:
  - api.toml:
```
[features]
listen_v2 = true
```
  - engine.toml:
```
[flux]
enabled = true
```
Resolves a Nova-3 diarization issue present in the prior release (release-250929).
General Improvements — Keeps our software up-to-date.