Entity Detection Now Available for Streaming Speech-to-Text
We’re excited to announce that Entity Detection is now available for streaming (real-time) speech-to-text with Nova 3, Nova 2, Nova, and Enhanced models.
detect_entities=true
Previously available only for pre-recorded audio, you can now identify and extract over 50 unique entity types in real-time streaming transcriptions, including email addresses, names, locations, phone numbers, social security numbers, and more.
Key features for streaming Entity Detection:
- Model support: Nova 3, Nova 2, Nova, and Enhanced models only (not available for Base models or Flux)
- Real-time detection: Entities are included in final results (
is_final: truemessages) - Enhanced formatting: Includes both
value(formatted) andraw_value(original spoken text) fields when formatting is enabled - Automatic inclusion: Empty
entitiesarray returned when no entities are detected
Example usage:
For detailed information, see our Entity Detection documentation and supported entity types.