Flux Multilingual & Language Prompting
Flux Multilingual & Language Prompting
Use Flux Multilingual with language hints to get near-monolingual accuracy in 10 languages, or let the model auto-detect.
Flux Multilingual (flux-general-multi) is a single model supporting 10 languages with the same turn-aware, interruption-aware conversational intelligence as flux-general-en. The optional language_hint parameter biases the model toward specific languages, delivering accuracy on par with dedicated monolingual models. Without hints, the model auto-detects the spoken language.
Flux Multilingual uses the same production endpoint and API key you already use for Flux. Just set model=flux-general-multi — no new credentials or endpoints required. Pricing is the same as flux-general-en.
An EU endpoint is also available: wss://api.eu.deepgram.com/v2/listen?model=flux-general-multi
SDK and self-hosted support are not yet available for flux-general-multi — use direct WebSocket connections. Both are coming soon.
Supported Languages
Locale-level subtags (e.g., en-GB, pt-BR) are accepted. If no exact match exists, Flux treats them as the base language code.
The language_hint Parameter
language_hint string (optional, repeatable)
Pass one or more language_hint values to bias the model toward specific languages. This improves accuracy when you know the expected language(s) ahead of time.
language_hint is only supported on flux-general-multi. Sending it to any other model (including flux-general-en) returns a 400 error.
Existing Flux concurrency limits now apply across both flux-general-en and flux-general-multi (shared pool). See API Rate Limits for details.
Usage Scenarios
1. Known Single Language
When you know the caller’s language ahead of time (e.g., a Spanish-language call center), set a single language_hint for best accuracy.
2. Known Subset of Languages
When callers may speak one of several languages (e.g., a bilingual English/Spanish support line), pass multiple hints. The model biases toward the specified set while still producing accurate transcripts regardless of which language is spoken.
3. Unknown Language
When you have no knowledge of what language the caller will speak, omit language_hint entirely. The model auto-detects the language from the audio.
4. Code-Switching
When speakers switch between languages mid-conversation (e.g., a bilingual speaker mixing English and Spanish), set hints for the expected languages. Flux handles mid-sentence language switches natively.
Language Detection in TurnInfo Events
When using flux-general-multi, all TurnInfo events include two additional fields:
Example TurnInfo Response
Use the languages field to route downstream processing — for example, selecting the correct TTS voice or LLM prompt language based on what the user actually spoke.
Mid-Stream Reconfiguration
You can update language hints during a stream using the Configure control message without disconnecting. This is useful when conversational context changes — for example, after detecting the caller’s language, you can narrow the hints for better accuracy.
Pattern: Detect-then-Lock
A common voice agent pattern is to start a call with broad language detection, then lock in the detected language for the rest of the conversation. This gives you the best of both worlds: flexible auto-detection at the start and high-accuracy single-language transcription once the caller’s language is known.
How it works:
- Connect with no hints (or a broad subset of expected languages) to let the model auto-detect.
- Read the
languagesfield from the firstEndOfTurnevent to identify the caller’s language. - Send a Configure message to lock in that language as a single hint for the remainder of the call.
- Monitor for language changes — if a subsequent turn returns a different primary language in
languages, send another Configure to update the hint.
Step 1 — Connect with broad detection:
Or, if you know callers will speak one of a few languages, start with a subset:
Step 2 — Read the detected language from the first EndOfTurn:
The first entry in languages is the primary language by word count.
Step 3 — Lock in the detected language:
This biases all subsequent transcription toward Spanish, improving accuracy for the rest of the call.
Step 4 — Handle language switches (optional):
If a later turn returns a different primary language, update the hint:
Locking in a single language after detection delivers the best accuracy — comparable to using a dedicated monolingual model. For calls where code-switching is expected throughout, keep multiple hints active instead of locking to one language.
Error Handling
Example error response:
Flux Multilingual vs. flux-general-en
flux-general-en remains available and recommended for English-only workloads. Use flux-general-multi when you need multilingual support or expect non-English audio. Both models share the same turn detection architecture, end-of-turn configuration, and control message interface.
Related Resources
- Getting Started with Flux — Quickstart guide
- End-of-Turn Configuration — Tune turn detection behavior
- Configure Control Message — Update settings mid-stream
- Multilingual Codeswitching — Codeswitching on Nova-2/Nova-3
- Models & Languages Overview — All models and supported languages