Voice Agent API

New Features

🎯 Smart Formatting for More Readable Conversations

We’ve added a new smart_format option to improve transcript readability in UI applications. This feature enables smart formatting for better user experience when displaying transcribed conversation.

Key Features:

  • Enhanced transcript formatting for UI applications
  • Defaults to false for backward compatibility

Implementation: Configure the smart_format option in your Voice Agent listen provider settings:

1{
2 "agent": {
3 "listen": {
4 "provider": {
5 "type": "deepgram",
6 "model": "nova-3",
7 "smart_format": true
8 }
9 }
10 }
11}

For complete implementation details, see our Voice Agent configuration documentation.

🔒 Model Improvement Program Opt-Out

Users can now opt out of our Model Improvement Program when using the Voice Agent API.

Implementation: Add mip_opt_out: true to your Settings message:

1{
2 "type": "Settings",
3 "mip_opt_out": true,
4 "agent": {
5 "listen": {
6 "provider": {
7 "type": "deepgram",
8 "model": "nova-3"
9 }
10 }
11 }
12}

For more information about the Model Improvement Program and opt-out options, visit our Model Improvement Partnership Program documentation.

🤖 Gemini LLM Support

We’ve added support for Google’s Gemini LLMs in our Voice Agent API! This expands our LLM options to include Google’s powerful language models.

Available Models:

  • Gemini 2.5 Flash
  • Gemini 2.0 Flash
  • Gemini 2.0 Flash Lite

For complete information about supported LLMs including Gemini models, visit our Voice Agent LLM Models documentation.