For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Ask AIPlaygroundLoginFree API Key
HomeAPI ReferenceVoice AgentSpeech-to-TextText-to-SpeechIntelligenceSelf-Hosted Deployments
HomeAPI ReferenceVoice AgentSpeech-to-TextText-to-SpeechIntelligenceSelf-Hosted Deployments
    • Home
    • Ask AI
    • Support
    • Changelog
  • Trust & Security
    • Security Policy
    • Data Privacy Compliance
    • Information Security & Privacy
  • SDKs
    • SDK Features
  • Guides
LogoLogo
Ask AIPlaygroundLoginFree API Key

Changelog

April 13, 2023
April 13, 2023
Was this page helpful?
Previous

March 15, 2023

Next
Built with

Latest Speech Recognition Model Releases

We are pleased to announce the latest model releases to our speech recognition services.

Deepgram Nova

Deepgram Nova presents the new state-of-the-art in speech recognition. Read more about it in our announcement.

Nova is available with our general and phonecall models. To access either, please use the following syntax in your request:

  • General: model=nova or model=general&tier=nova
  • Phonecall: model=phonecall&tier=nova

Support for Deepgram Nova includes:

  • English language support.
  • Pre-recorded and live streaming audio transcription.
  • Use through Deepgram’s Hosted API or On-Prem Deployments.

Please view pricing at deepgram.com/pricing.

Deepgram Whisper Cloud and Whisper On-Prem

Deepgram Whisper Cloud and Whisper On-Prem integrate OpenAI’s Whisper models with Deepgram’s powerful API and feature set.

Deepgram Whisper Cloud and Whisper On-Prem can be accessed with the following API parameters:

  • model=whisper or model=whisper-SIZE

  • Available sizes include:

    • whisper-tiny
    • whisper-base
    • whisper-small
    • whisper-medium (default)
    • whisper-large (defaults to OpenAI’s large-v2)
  • *Note: You should not specify a *tier when using Whisper models.

Use of Deepgram Whisper Cloud is subject to a rate limit of 50 requests per minute or 15 concurrent requests.

Support for Deepgram Whisper Cloud and Whisper On-Prem include:

  • A selection of Deepgram’s transcription features, including:

    • Diarization

    • Word-level time stamps

    • Language detection

    • Redaction

    • Diarization

    • Smart Formatting

      • Punctuation, Numeral Formatting, Find and Replace, Paragraphs, Utterances
    • Multichannel Support

    • Callback Support

    • Summarization

    • Topic Detection

  • OpenAI’s list of supported languages.

  • Pre-recorded transcription.

  • Use through Deepgram’s Hosted API or On-Prem Deployments.

Please view pricing at deepgram.com/pricing.

To learn more about the various parameters you can use to customize your transcriptions with Deepgram, check out the list of Deepgram’s features in our documentation.