Tier

Learn about Deepgram's Tier feature, which allows you to associate your API requests with a specific tier.

Deepgram's tier feature allows you to associate your API requests with a specific tier architecture, which indicates the level of model you would like to use in your request.

Tiers & Options

Below is a list of model tiers (tier), each of which has its own list of model options (model).

ℹ️

To learn more about models, see Models. To learn more about pricing, see Deepgram Pricing & Plans.

For self-serve customers, Deepgram provides Nova, Enhanced, and Base tiers along with each tier's model options. To learn more about pricing, see Deepgram Pricing & Plans.

Nova

Nova model tiers are our newest and most powerful speech-to-text models on the market today. Deepgram's Nova models have the following options:

  • general: Optimized for everyday audio processing. Likely to be more accurate than any region-specific Base model for the language for which it is enabled. If you aren't sure which model to select, start here.

  • phonecall: Optimized for low-bandwidth audio phone calls.

The Nova tier and its model options can be called with the following syntax:

https://api.deepgram.com/v1/listen?tier=nova
https://api.deepgram.com/v1/listen?tier=nova&model=OPTION

Read the Nova Quickstart to learn more about getting started with Nova.

Enhanced

Enhanced model tiers are still some of our most powerful ASR models; they generally have higher accuracy and better word recognition than our base models, and they handle uncommon words significantly better. Deepgram's Enhanced models have the following options:

  • general: Optimized for everyday audio processing. Likely to be more accurate than any region-specific Base model for the language for which it is enabled. If you aren't sure which model to select, start here.

  • meeting beta: Optimized for conference room settings, which include multiple speakers with a single microphone.

  • phonecall: Optimized for low-bandwidth audio phone calls.

  • finance beta: Optimized for multiple speakers with varying audio quality, such as might be found on a typical earnings call. Vocabulary is heavily finance oriented.

The Enhanced tier and its model options can be called with the following syntax:

https://api.deepgram.com/v1/listen?tier=enhanced
https://api.deepgram.com/v1/listen?tier=enhanced&model=OPTION

Base

Base model tiers are built on our signature end-to-end deep learning speech model architecture. They offer a solid combination of accuracy and cost effectiveness in some cases. Deepgram's Base models have the following options:

  • general: (Default) Optimized for everyday audio processing.
  • meeting: Optimized for conference room settings, which include multiple speakers with a single microphone.
  • phonecall: Optimized for low-bandwidth audio phone calls.
  • voicemail: Optimized for low-bandwidth audio clips with a single speaker. Derived from the phonecall model.
  • finance: Optimized for multiple speakers with varying audio quality, such as might be found on a typical earnings call. Vocabulary is heavily finance oriented.
  • conversationalai: Optimized to allow artificial intelligence technologies, such as chatbots, to interact with people in a human-like way.
  • video: Optimized for audio sourced from videos.

The Base tier and its model options can be called with the following syntax:

https://api.deepgram.com/v1/listen?tier=base
https://api.deepgram.com/v1/listen?tier=base&model=OPTION

⚠️

You should only submit model options that are available for a particular tier; submitting a model option that is not available for a tier will result in a parse error.

Try it out

To transcribe audio from a file on your computer using a particular tier, run the following curl command in a terminal or your favorite API client.

ℹ️

Be sure to replace the placeholder OPTION with your chosen tier and YOUR_DEEPGRAM_API_KEY with your Deepgram API Key. You can create an API Key in the Deepgram Console.

curl \
  --request POST \
  --header 'Authorization: Token YOUR_DEEPGRAM_API_KEY' \
  --data-binary @youraudio.wav \
  --url 'https://api.deepgram.com/v1/listen?tier=OPTION'