TTS Voice Controls
Adjust speaking speed and override pronunciation for specific words using Aura-2 controls.
Aura-2 Controls enable fine-grained adjustments to speech output, allowing you to modify speaking speed and override pronunciation for specific words. These controls are designed for enterprise use cases requiring precise voice quality for industry-specific terminology, brand names, and complex content.
Availability
Speed control
Adjust the speaking rate of generated audio. Speed control modifies the pace of speech while maintaining natural prosody and voice quality.
Parameters
For Spanish voices, the recommended speed range is 0.9 - 1.5. Values below 0.9 may introduce disfluencies.
Example request
Speed values
Speed values outside the 0.7x–1.5x range will return an error.
Pronunciation control
Override the default pronunciation of specific words using International Phonetic Alphabet (IPA) notation.
Syntax
Pronunciation overrides are specified inline within the text using escaped JSON objects:
Where:
wordis the original text (used for billing and display)pronounceis the IPA phonetic transcription- Curly braces must be escaped with backslashes (
\{and\})
Example request
The curly braces must be escaped with \\{ and \\} in the cURL command.
Common use cases
Sourcing IPA transcriptions
A few rules of thumb for producing IPA for your own vocabulary:
- Short lists (<20 words): generate with an LLM and validate by ear.
- Longer lists: use authoritative dictionaries that publish IPA directly:
Best practices:
- Always validate by ear. IPA that looks correct on the page can still sound off when synthesized — listen to the output before shipping.
- Match the dialect. UK and US pronunciations differ (e.g., schedule, aluminum). Make sure the IPA you choose matches the voice and audience you’re targeting.
Validation rules
Combining controls
Speed and pronunciation controls can be used together in the same request.
Healthcare example
Use raw string (r'...') with escaped braces \{ and \} for pronunciation control in Python.
Brand consistency example
IPA reference
Vowels (American English)
Consonants
Stress markers
Billing
Example: Hello, \{"word": "Mr.", "pronounce": "ˈmɪstɚ"\} Bond. is billed as Hello, Mr. Bond. (16 characters)