For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Ask AIPlaygroundLoginFree API Key
HomeAPI ReferenceVoice AgentSpeech-to-TextText-to-SpeechIntelligenceSelf-Hosted Deployments
HomeAPI ReferenceVoice AgentSpeech-to-TextText-to-SpeechIntelligenceSelf-Hosted Deployments
  • Get Started
    • Overview
    • Build a Voice Agent
    • Feature Overview
    • Template Apps
  • Configure
    • Overview
    • STT Models
    • LLM Models
    • TTS Models
    • Media Inputs & Outputs
    • Prompting Voice Agents
    • Multilingual Voice Agents
    • Maintaining Context
    • Reusable Agent Configurations
  • Build
    • Multi-Agent Architecture
  • Connect
  • Controls
  • Optimize
    • Voice Agent TTS Controls
    • Message Flow
    • Audio & Playback
    • Audio Preprocessing & Barge-In
    • Adaptive Echo Cancellation
  • Resources
    • SDKs
    • UI Components
    • API Reference
LogoLogo
Ask AIPlaygroundLoginFree API Key
On this page
  • Explore Further
Get Started

Getting Started

Build real-time, interactive voice agents powered by Deepgram’s speech-to-text, LLM integration, and text-to-speech, all over a single WebSocket connection.

Was this page helpful?

Build a Voice Agent

Learn how to build a real-time voice agent using Deepgram’s Agent API.

Next
Built with
Deepgram API Playground
Try this feature out in our API Playground.

Deepgram’s Voice Agent API handles the full speech pipeline (listening, thinking, and speaking) so you can focus on what your agent does, not how it hears or talks.

Build a Voice Agent

Step-by-step guide to creating your first voice agent with Python, JavaScript, C#, or Go using the server-side SDKs and WebSocket API.

Configure Your Agent

Set up speech-to-text models, LLM providers, TTS voices, endpointing, and audio formats for your voice agent.

Function Calling

Let your agent call external APIs and tools mid-conversation.

Multi-Agent Architecture

Build systems where multiple specialized agents hand off conversations based on context, intent, or domain expertise.

Explore Further

Reusable Configurations

Save and reuse agent settings across projects.

Telephony

Connect voice agents to phone networks for inbound and outbound calls.

Browser Agent SDK

Add voice AI to any web application. Four composable packages, from a single script tag to a fully custom React interface.

Prompting

Write effective system prompts that shape how your agent behaves on a live call.

Template Apps

Clone a working voice agent project and start building.

API Reference

Complete WebSocket protocol reference for the Agent API.

Feature Overview

Full list of Voice Agent API capabilities.