For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Ask AIPlaygroundLoginFree API Key
HomeAPI ReferenceVoice AgentSpeech-to-TextText-to-SpeechIntelligenceSelf-Hosted Deployments
HomeAPI ReferenceVoice AgentSpeech-to-TextText-to-SpeechIntelligenceSelf-Hosted Deployments
    • Getting Started with Speech to Text
  • Pre-Recorded Audio
    • Getting Started
    • Feature Overview
    • Template Apps
  • Streaming Audio
    • Compare Flux to Nova-3
  • Models and Languages
    • Models & Languages Overview
    • Languages Support
    • Language Detection
    • Multilingual Codeswitching
    • Model Options
    • Version
  • Formatting
    • Speaker Diarization
    • Dictation
    • Filler Words
    • Measurements
    • Numerals
    • Paragraphs
    • Profanity Filtering
    • Punctuation
    • Redaction
    • Smart Formatting
    • Supported Entity Types
    • Utterances
    • Utterance Split
  • Custom Vocabulary
    • Find and Replace
    • Keyterm Prompting
    • Keywords
    • Search
  • Media Input Settings
    • Channels
    • Encoding
    • Multichannel
    • Sample Rate
  • Results Processing
    • Understanding Word Confidence Scores
    • STT Callback
    • STT Tagging
    • Extra Metadata
  • Migrating
    • Migrating From Amazon Web Services (AWS) Transcribe to Deepgram
    • Migrating From Google Speech-to-Text (STT) to Deepgram
    • Migrating From OpenAI Whisper to Deepgram
    • Migrating from AssemblyAI Speech-to-Text to Deepgram
LogoLogo
Ask AIPlaygroundLoginFree API Key
On this page
  • Model selection
  • Formatting
  • Custom vocabulary
  • Media input settings
  • Result processing
  • Intelligence
  • Rate Limits
  • Deepgram Self-Hosted
Pre-Recorded Audio

Feature Overview

Below is a matrix of Deepgram’s speech-to-text Pre-Recorded features. Please refer to the corresponding documentation for more details.

Was this page helpful?
Previous

Template Apps

Get up and running fast with our pre-recorded speech-to-text template applications, fully integrated with Deepgram out-of-the-box.

Next
Built with

Model selection

FeatureLanguage(s)
ModelAll available
LanguageAll available
Language DetectionAll not in beta
Multilingual CodeswitchingSpecific languages only
VersionAll available

Formatting

FeatureLanguage(s)
Smart FormattingAll available
Speaker DiarizationAll available
Filler wordsEnglish (all available regions)
NumeralsSpecific languages only
PunctuationAll available
ParagraphsAll in which words are delimited by spaces
Profanity FilterSpecific languages only
RedactionSpecific languages only
UtterancesAll available
Utterance SplitAll available

Custom vocabulary

FeatureLanguage(s)
Find and ReplaceAll available
Keyterm Prompting (Also see Legacy Keywords)All available
SearchAll available

Media input settings

FeatureLanguage(s)
ChannelsAll available
EncodingAll available
MultichannelAll available
Sample RateAll available

Result processing

FeatureLanguage(s)
CallbackAll available
TaggingAll available
Extra MetadataAll available

Intelligence

FeatureLanguage(s)
Sentiment AnalysisEnglish (all available regions)
Intent RecognitionEnglish (all available regions)
Topic DetectionEnglish (all available regions)
SummarizationEnglish (all available regions)
Entity DetectionEnglish (all available regions)

Rate Limits

For information on Deepgram’s Concurrency Rate Limits, refer to our API Rate Limits Documentation.

Deepgram Self-Hosted

Having challenges with performance and latency? Check out Deepgram’s Self-Hosted Solution to get the benefits of running your own hosted instance of Deepgram.