TTS Models

An overview of Text-to-Speech providers and models you can use with the Voice Agent API.

Voice Agent

By default Deepgram Text-to-Speech will be used with the Voice Agent API, but if you opt to use another provider’s TTS model with your Agent, you can do so by applying the following settings.

You can set your Text-to-Speech model in the Settings Message for your Voice Agent. See the docs for more information.

Deepgram TTS models

These are just some of our most popular TTS voices. For a complete list see the TTS Models Documentation .

agent.speak.modelExpressed Gender
aura-2-thalia-enFeminine
aura-2-andromeda-enFeminine
aura-2-helena-enFeminine
aura-2-apollo-enMasculine
aura-2-arcas-enMasculine
aura-2-aries-enMasculine

Example

JSON
1{
2 "speak": {
3 "provider": {
4 "type": "deepgram",
5 "model": "aura-2-thalia-en"
6 }
7 }
8}

Third Party TTS models

To use a third party TTS voice, specify the TTS provider and required parameters.

OpenAI

For OpenAI you can refer to this article on how to find your voice ID.

ParameterTypeDescription
agent.speak.provider.typeStringMust be open_ai
agent.speak.provider.modelStringThe TTS model to use
agent.speak.provider.endpointObjectRequired and must include url and headers
agent.speak.provider.endpoint.urlStringYour OpenAI API endpoint URL
agent.speak.provider.endpoint.headersObjectRequired headers for authentication

Eleven Labs

For ElevenLabs you can refer to this article on how to find your Voice ID or use their API to retrieve it. We support any of ElevenLabs’ Turbo 2.5 voices to ensure low latency interactions.

ParameterTypeDescription
agent.speak.provider.typeStringMust be eleven_labs
agent.speak.provider.model_idStringThe model ID to use
agent.speak.provider.language_codeStringOptional Language code
agent.speak.provider.endpointObjectRequired and must include url and headers
agent.speak.provider.endpoint.urlStringYour Eleven Labs API endpoint URL
agent.speak.provider.endpoint.headersObjectRequired headers for authentication

Cartesia

For Cartesia you can use their API to retrieve a voice ID.

ParameterTypeDescription
agent.speak.provider.typeStringMust be cartesia
agent.speak.provider.model_idStringThe model ID to use
agent.speak.provider.voiceObjectCartesia Voice configuration
agent.speak.provider.voice.modeStringThe voice mode to use
agent.speak.provider.voice.idStringThe voice ID to use
agent.speak.provider.languageStringOptional language setting
agent.speak.provider.endpointObjectRequired and must include url and headers
agent.speak.provider.endpoint.urlStringYour Cartesia API endpoint URL
agent.speak.provider.endpoint.headersObjectRequired headers for authentication

Example Payload

JSON
1 "speak": {
2 "provider": {
3 "type": "eleven_labs", // Optional if TTS provider is Deepgram. Use for Eleven Labs OR Cartesia
4 "model": "open_ai_voice", // Optional if TTS provider is Deepgram. Use for OpenAI OR Deepgram
5 "model_id": "1234567890", // Optional if TTS provider is Deepgram. Use for Eleven Labs OR Cartesia
6 "voice": {
7 "mode": "Cartesia mode type", // Optional if TTS provider is Deepgram. Use for Cartesia
8 "id": "Cartesia voice id" // Optional if TTS provider is Deepgram. Use for Cartesia
9 },
10 "language": "en", // Optional if TTS provider is Deepgram. Use for Cartesia
11 "language_code": "en-US" // Optional if TTS provider is Deepgram. Use for Eleven Labs
12 },
13 "endpoint": { // Optional if TTS provider is Deepgram
14 "url": "https://api.example.com/tts",
15 "headers": {
16 "authorization": "Bearer {{token}}"
17 }
18 }
19 },

What’s Next

Built with