For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Ask AIPlaygroundLoginFree API Key
HomeAPI ReferenceVoice AgentSpeech-to-TextText-to-SpeechIntelligenceSelf-Hosted Deployments
HomeAPI ReferenceVoice AgentSpeech-to-TextText-to-SpeechIntelligenceSelf-Hosted Deployments
    • Getting Started with Speech to Text
  • Pre-Recorded Audio
    • Getting Started
    • Feature Overview
    • Template Apps
  • Streaming Audio
      • Getting Started
      • Feature Overview
      • Live Streaming Starter Kit
      • Template Apps
    • Compare Flux to Nova-3
  • Models and Languages
    • Models & Languages Overview
    • Languages Support
    • Language Detection
    • Multilingual Codeswitching
    • Model Options
    • Version
  • Formatting
    • Speaker Diarization
    • Dictation
    • Filler Words
    • Measurements
    • Numerals
    • Paragraphs
    • Profanity Filtering
    • Punctuation
    • Redaction
    • Smart Formatting
    • Supported Entity Types
    • Utterances
    • Utterance Split
  • Custom Vocabulary
    • Find and Replace
    • Keyterm Prompting
    • Keywords
    • Search
  • Media Input Settings
    • Channels
    • Encoding
    • Multichannel
    • Sample Rate
  • Results Processing
    • Understanding Word Confidence Scores
    • STT Callback
    • STT Tagging
    • Extra Metadata
  • Migrating
    • Migrating From Amazon Web Services (AWS) Transcribe to Deepgram
    • Migrating From Google Speech-to-Text (STT) to Deepgram
    • Migrating From OpenAI Whisper to Deepgram
    • Migrating from AssemblyAI Speech-to-Text to Deepgram
LogoLogo
Ask AIPlaygroundLoginFree API Key
On this page
  • Model Selection
  • Formatting
  • Custom Vocabulary
  • Intelligence
  • Media Input Settings
  • Results Processing
  • Control Messages
  • Rate Limits
  • Deepgram Self-Hosted
Streaming AudioTranscription (Nova-3)

Feature Overview

Below is a matrix of Deepgram’s Speech-to-Text Streaming features. Please refer to the corresponding documentation for more details.

Was this page helpful?
Previous

Live Streaming Starter Kit

Deepgram's Live Streaming Starter Kit will take you step by step through the process of getting up and running with Deepgram's live streaming API.
Next
Built with

To learn how to get up and running with Streaming Speech-to-Text, read the Streaming Speech-to-Text getting started guide.

Model Selection

FeatureLanguage(s)
ModelAll available
LanguageAll available
Multilingual CodeswitchingSpecific languages only
VersionAll available

Formatting

FeatureLanguage(s)
Smart FormattingAll available
Speaker DiarizationAll available
NumeralsSpecific languages only
PunctuationAll available
Profanity FilterSpecific languages only
RedactionSpecific languages only

Custom Vocabulary

FeatureLanguage(s)
Find and ReplaceAll available
Keyterm Prompting (Also see Legacy Keywords)All available
SearchAll available

Intelligence

FeatureModel SupportLanguage(s)
Entity DetectionNova, Nova-2, Nova-3, EnhancedEnglish (all available regions)

Media Input Settings

FeatureLanguage(s)
MultichannelAll available
Sample rateAll available
ChannelsAll available
EncodingAll available

Results Processing

FeatureLanguage(s)
CallbackAll available
EndpointingAll available
Utterance EndAll available
Speech StartedAll available
Interim resultsAll available
TaggingAll available
Extra MetadataAll available

Control Messages

Feature
Close Stream
Finalize
Keep Alive

Rate Limits

For information on Deepgram’s Concurrency Rate Limits, refer to our API Rate Limits Documentation.

Deepgram Self-Hosted

Having challenges with performance and latency? Check out Deepgram’s Self-Hosted Solution to get the benefits of running your own hosted instance of Deepgram.