On-Premises Release 221031
Deepgram released a new version of its on-premises solution.
On-Premises Release 221031: Docker Hub Images
-
deepgram/onprem-api:1.72.2 -
deepgram/onprem-engine:3.37.8- Minimum required NVIDIA driver version:
>=450.80.02
- Minimum required NVIDIA driver version:
-
deepgram/onprem-license-proxy:1.2.2 -
deepgram/onprem-billing:1.4.0 -
deepgram/onprem-metrics-server:2.0.0
Changes
-
Deepgram On-premises users can now choose between DeepgramāsĀ BaseĀ andĀ EnhancedĀ models in an ASR request via theĀ
tierĀ query parameter, whereĀtier=baseĀ will select theĀ *BaseĀ *model andĀtier=enhancedĀ will select theĀ EnhancedĀ model.tierĀ works in conjunction with theĀdetect_languageĀ query parameter.- For users whose Enhanced models do not include the ā*-enhancedā suffix in the model name, the use of theĀ
tierĀ parameter is required. - Models may still be invoked directly via the model UUID without the use ofĀ
tier.
-
Deepgram On-premises deployments now support the following Understanding features (with the accompanying Understanding model deployed on-prem and the requisite configuration changes):
-
Topic Detection enables users to detect the most important and relevant topics that are referenced in speech within the audio.
detect_topics=true&punctuate=true- This requires the addition of the following section to theĀ
api.tomlĀ file:[features] topic_detection = true
- This requires the addition of the following section to theĀ
-
When you use these Understanding features, please note that theĀ punctuate=trueĀ parameter is required as part of the ASR request. If you do not explicitly include this parameter, it will be implicitly included by the system.
-
-
Deepgram On-premises now supports the all-new āCloseStreamā web socket message for closing your live audio streams. Please see theĀ New Methods for Closing StreamsĀ changelog post for more information, or refer to the API documentation forĀ Transcribing Live Streaming Audio.