November 10, 2025

Entity Detection Now Available for Streaming Speech-to-Text

We’re excited to announce that Entity Detection is now available for streaming (real-time) speech-to-text with Nova 3, Nova 2, Nova, and Enhanced models.

detect_entities=true

Previously available only for pre-recorded audio, you can now identify and extract over 50 unique entity types in real-time streaming transcriptions, including email addresses, names, locations, phone numbers, social security numbers, and more.

Key features for streaming Entity Detection:

  • Model support: Nova 3, Nova 2, Nova, and Enhanced models only (not available for Base models or Flux)
  • Real-time detection: Entities are included in final results (is_final: true messages)
  • Enhanced formatting: Includes both value (formatted) and raw_value (original spoken text) fields when formatting is enabled
  • Automatic inclusion: Empty entities array returned when no entities are detected

Example usage:

1const connection = deepgram.listen.live({
2 model: "nova-3",
3 language: "en-US",
4 smart_format: true,
5 detect_entities: true,
6});

For detailed information, see our Entity Detection documentation and supported entity types.