Genesys Cloud CX and Deepgram Voice Agent

The Deepgram Voice Agent API integrates with Genesys Cloud CX through the Genesys Audio Connector, establishing a bidirectional WebSocket connection for real-time voice input and output.

The Voice Agent API bundles speech-to-text, LLM reasoning, and text-to-speech into a single pipeline, delivering approximately 425 ms round-trip latency.

How it works

A caller enters your Genesys call flow
Genesys connects to your Voice Agent through a WebSocket
Audio streams to Deepgram (STT, LLM, TTS)
The agent responds with synthesized voice
The caller can interrupt naturally with barge-in support

Before you begin

You will need:

A Deepgram API key with Voice Agent enabled
A Genesys Cloud CX account with Audio Connector access
A public server endpoint connected to Deepgram

Before you can use Deepgram, you’ll need to create a Deepgram account. Signup is free and includes $200 in free credit and access to all of Deepgram’s features!

Step 1: Create an Audio Connector in Genesys

Go to Admin > Integrations
Add Audio Connector
Name the integration (for example, Deepgram Voice Agent)
Set Connection URI to wss://integrations.deepgram.com/genesys/ac
Add your Deepgram API key under Credentials
Activate the integration

Step 2: Add the connector to your Architect flow

Open Architect in Genesys Cloud
Open or create an Inbound Call Flow
Add a Call Audio Connector action
Select the integration you created in Step 1

This action hands the call to the Deepgram Voice Agent.

Step 3: Configure your AI agent

In the Audio Connector’s Session Variables, add a variable:

Name: config
Value: a JSON string containing your agent configuration

For optimal audio quality, include the following in your agent configuration:

1 "audio": {
2   "input": {
3     "encoding": "mulaw",
4     "sample_rate": 8000
5   },
6   "output": {
7     "encoding": "mulaw",
8     "sample_rate": 8000,
9     "container": "none"
10   }
11 }

To allow the Voice Agent to end the call, add an end_conversation function to the agent configuration. The function’s properties are sent back to Genesys and can be mapped as session output variables.

Example end conversation function

1 {
2   "name": "end_conversation",
3   "description": "Call this function when the user says goodbye or the conversation ends.",
4   "parameters": {
5     "type": "object",
6     "properties": {
7       "needs_escalation": {
8         "type": "boolean",
9         "description": "Set to true if the user requests a human or manager."
10       }
11     }
12   }
13 }

Step 4: Set up middleware server (self-hosted only)

If you use a self-hosted setup, deploy a middleware server to connect Genesys with the Deepgram Voice Agent. Contact your Deepgram account executive for access and setup instructions.

Customization options

You can adjust your agent’s behavior using these configuration fields:

Setting	Field
Personality	`think.prompt`
Voice	`speak.provider.model`
Greeting	`greeting`
Knowledge / abilities	`think.functions`