TTS Voice Controls
Adjust speaking speed and override pronunciation for specific words using Aura-2 controls.
This feature is currently in Early Access. To request access or leave feedback, contact your Account Executive or reach out to sales@deepgram.com.
Aura-2 Controls enable fine-grained adjustments to speech output, allowing you to modify speaking speed and override pronunciation for specific words. These controls are designed for enterprise use cases requiring precise voice quality for industry-specific terminology, brand names, and complex content.
During Early Access, Aura-2 Controls are available for English voices only via the REST API.
Speed control
Adjust the speaking rate of generated audio. Speed control modifies the pace of speech while maintaining natural prosody and voice quality.
Parameters
Example request
Speed values
The 0.7x–1.5x range maintains natural prosody with minimal disfluencies. Speeds outside this range would introduce artifacts that degrade user experience.
Pronunciation control
Override the default pronunciation of specific words using International Phonetic Alphabet (IPA) notation.
Syntax
Pronunciation overrides are specified inline within the text using escaped JSON objects:
Where:
wordis the original text (used for billing and display)pronounceis the IPA phonetic transcription- Curly braces must be escaped with backslashes (
\{and\})
Example request
The curly braces must be escaped with \\{ and \\} in the cURL command.
Common use cases
Validation rules
Combining controls
Speed and pronunciation controls can be used together in the same request.
Healthcare example
Use raw string (r'...') with escaped braces \{ and \} for pronunciation control in Python.
Brand consistency example
IPA reference
Vowels (American English)
Consonants
Stress markers
Billing
Example: Hello, \{"word": "Mr.", "pronounce": "ˈmɪstɚ"\} Bond. is billed as Hello, Mr. Bond. (16 characters)
Response headers
Error handling
Speed out of range
Invalid pronunciation
Limits
Early access scope
Pause control and WebSocket support are planned for future releases.