November 10, 2025 | Deepgram's Docs

Entity Detection Now Available for Streaming Speech-to-Text

We’re excited to announce that Entity Detection is now available for streaming (real-time) speech-to-text with Nova 3, Nova 2, Nova, and Enhanced models.

detect_entities=true

Previously available only for pre-recorded audio, you can now identify and extract over 50 unique entity types in real-time streaming transcriptions, including email addresses, names, locations, phone numbers, social security numbers, and more.

Key features for streaming Entity Detection:

Model support: Nova 3, Nova 2, Nova, and Enhanced models only (not available for Base models or Flux)
Real-time detection: Entities are included in final results (is_final: true messages)
Enhanced formatting: Includes both value (formatted) and raw_value (original spoken text) fields when formatting is enabled
Automatic inclusion: Empty entities array returned when no entities are detected

Example usage:

1 const connection = deepgram.listen.live({
2   model: "nova-3",
3   language: "en-US",
4   smart_format: true,
5   detect_entities: true,
6 });

For detailed information, see our Entity Detection documentation and supported entity types.