TTS Models
An overview of Text-to-Speech providers and models you can use with the Voice Agent API.
By default Deepgram Text-to-Speech will be used with the Voice Agent API, but if you opt to use another provider’s TTS model with your Agent, you can do so by applying the following settings.
You can set your Text-to-Speech model in the Settings Message for your Voice Agent. See the docs for more information.
Deepgram TTS models
For a complete list of Deepgram TTS models see TTS Voice Selection.
Example
Third Party TTS models
To use a third party TTS voice, specify the TTS provider and required parameters.
OpenAI
For OpenAI you can refer to this article on how to find your voice ID.
Example
Eleven Labs
For ElevenLabs you can refer to this article on how to find your Voice ID or use their API to retrieve it. We support any of ElevenLabs’ Turbo 2.5 voices to ensure low latency interactions.
Example
Cartesia
For Cartesia you can use their API to retrieve a voice ID.
Example
AWS Polly
For AWS Polly you can refer to this article for a list of available voices.
If no engine is specified, AWS Polly defaults to Standard. If the chosen voice doesn’t support Standard, you’ll get an error like: “Standard engine not supported for {voice}.” In that case, you must explicitly specify the correct engine.
STS Example
IAM Example
What’s Next