Multilingual Voice Agents
A multilingual voice agent has two model decisions: which STT model transcribes the user, and which TTS model speaks the agent. Pick each one based on what your agent needs to do at runtime.
Pick your STT model
Flux Multilingual is the default recommendation. It handles turn awareness and interruption with the same low latency as flux-general-en. Use Nova-3 only when you need code-switching but not the conversational features.
Flux Multilingual configuration
agent.listen.provider.type:deepgramagent.listen.provider.version:v2agent.listen.provider.model:flux-general-multiagent.listen.provider.language_hint: one or more BCP-47 codes (optional)
The language_hint parameter biases the model toward specific languages and improves accuracy. With no hints, the model auto-detects the spoken language. Pass one hint for known-language calls and multiple hints for multilingual support centers. See Flux Multilingual & Language Prompting for the full hint reference and supported languages.
When you use flux-general-multi, user ConversationText events include languages_hinted and languages fields. See Conversation Text.
Nova-3 multi configuration
agent.listen.provider.model:nova-3agent.listen.provider.language:multi
Pick your TTS model
Deepgram Aura codeswitching (English/Spanish)
Aura ships five voices that switch between English and Spanish naturally inside one response: Aquila, Carina, Diana, Javier, and Selena.
agent.speak.provider.type:deepgramagent.speak.provider.model:aura-2-aquila-es(oraura-2-carina-es,aura-2-diana-es,aura-2-javier-es,aura-2-selena-es)
These voices handle mixed-language responses without switching providers. See TTS Models for the full Spanish voice catalog.
Third-party multilingual TTS
For other language combinations, set the speak provider to OpenAI, Eleven Labs, or Cartesia and pass agent.speak.provider.language: "multi". For Eleven Labs, this parameter maps to language_code.
Prompt for the language behavior you want
LLM behavior varies by provider. The prompt steers the agent toward a specific language strategy.