Build a Voice Agent
Learn how to build a real-time voice agent using Deepgram’s Agent API.
Learn how to build a real-time voice agent using Deepgram’s Agent API.
Deepgram’s Voice Agent API uses a single WebSocket connection to handle the entire conversational loop. The API integrates speech-to-text, a large language model (LLM), and text-to-speech into one stream.
Building a voice agent involves four main steps over a WebSocket:
Settings message to define the models, voices, and behavior.The Voice Agent API is available on the EU endpoint at wss://api.eu.deepgram.com/v1/agent/converse. See Regional Endpoints for details.
Select a language to start building your voice agent. Each tutorial provides a complete, end-to-end implementation.
Build a voice agent using the Deepgram Python SDK.
Build a voice agent using the Deepgram JavaScript SDK.
Build a voice agent using the Deepgram .NET SDK.
Build a voice agent using the Deepgram Go SDK.
Once you understand the basics, you can explore more advanced configurations:
Check out these repositories for more complex voice agent implementations:
For information on concurrency limits, refer to the API Rate Limits documentation.
Deepgram calculates usage based on WebSocket connection time. One hour of connection time equals one hour of API usage.