October 31, 2022

On-Premises Release 221031

Deepgram released a new version of its on-premises solution.

On-Premises Release 221031: Docker Hub Images

  • deepgram/onprem-api:1.72.2

  • deepgram/onprem-engine:3.37.8

    • Minimum required NVIDIA driver version: >=450.80.02
  • deepgram/onprem-license-proxy:1.2.2

  • deepgram/onprem-billing:1.4.0

  • deepgram/onprem-metrics-server:2.0.0

Changes

  • Deepgram On-premises users can now choose between Deepgram’sĀ BaseĀ andĀ EnhancedĀ models in an ASR request via theĀ tierĀ query parameter, whereĀ tier=baseĀ will select theĀ *BaseĀ *model andĀ tier=enhancedĀ will select theĀ EnhancedĀ model.

    • tierĀ works in conjunction with theĀ detect_languageĀ query parameter.
    • For users whose Enhanced models do not include the ā€œ*-enhancedā€ suffix in the model name, the use of theĀ tierĀ parameter is required.
    • Models may still be invoked directly via the model UUID without the use ofĀ tier.
  • Deepgram On-premises deployments now support the following Understanding features (with the accompanying Understanding model deployed on-prem and the requisite configuration changes):

    • Topic Detection enables users to detect the most important and relevant topics that are referenced in speech within the audio. detect_topics=true&punctuate=true

      • This requires the addition of the following section to theĀ api.tomlĀ file:[features] topic_detection = true
    • When you use these Understanding features, please note that theĀ punctuate=trueĀ parameter is required as part of the ASR request. If you do not explicitly include this parameter, it will be implicitly included by the system.

  • Deepgram On-premises now supports the all-new ā€œCloseStreamā€ web socket message for closing your live audio streams. Please see theĀ New Methods for Closing StreamsĀ changelog post for more information, or refer to the API documentation forĀ Transcribing Live Streaming Audio.