quay.io/deepgram/self-hosted-api:release-250828
quay.io/deepgram/self-hosted-api:1.156.1quay.io/deepgram/onprem-api:release-250828quay.io/deepgram/onprem-api:1.156.1quay.io/deepgram/self-hosted-engine:release-250828
Equivalent image to:
quay.io/deepgram/self-hosted-engine:3.100.0quay.io/deepgram/onprem-engine:release-250828quay.io/deepgram/onprem-engine:3.100.0Minimum required NVIDIA driver version: >=570.172.08
quay.io/deepgram/self-hosted-license-proxy:release-250828
quay.io/deepgram/self-hosted-license-proxy:1.8.0quay.io/deepgram/onprem-license-proxy:release-250828quay.io/deepgram/onprem-license-proxy:1.8.0quay.io/deepgram/self-hosted-billing:release-250828
quay.io/deepgram/self-hosted-billing:1.11.2quay.io/deepgram/onprem-billing:release-250828quay.io/deepgram/onprem-billing:1.11.2⚠️ Please be advised that this release is known to contain an Aura-2 TTS defect where the wrong conditioning priors are applied, resulting in the insertion of a random utterance. While the issue is rare, we strongly advise all Aura-2 TTS customers to update to release-250929 as soon as possible, where this behavior was resolved by a code change.
[health] section in engine.toml with gpu_required configuration option allows Engine to fail on startup if no GPU is detected. While Engine can run without a GPU, production deployments require one for acceptable performance. Set gpu_required = true to fail fast if no GPU is available, rather than running with severely degraded performance. Default: false.Important: This is the last release that will include onprem-* image tags. The Deepgram image repositories have been updated to reflect our “self-hosted” naming. Images should now be pulled from the self-hosted-* Quay repositories. Starting with the next release in September 2025, we will only publish new images to self-hosted-* repos, deprecating onprem-* repository variants.