For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Ask AIPlaygroundLoginFree API Key
HomeAPI ReferenceVoice AgentSpeech-to-TextText-to-SpeechIntelligenceSelf-Hosted Deployments
HomeAPI ReferenceVoice AgentSpeech-to-TextText-to-SpeechIntelligenceSelf-Hosted Deployments
  • Get Started
    • Overview
    • Build a Voice Agent
    • Feature Overview
    • Template Apps
  • Configure
    • Overview
    • STT Models
    • LLM Models
    • TTS Models
    • Media Inputs & Outputs
    • Prompting Voice Agents
    • Multilingual Voice Agents
    • Maintaining Context
    • Reusable Agent Configurations
  • Build
    • Multi-Agent Architecture
  • Connect
  • Controls
  • Optimize
    • Voice Agent TTS Controls
    • Message Flow
    • Audio & Playback
    • Audio Preprocessing & Barge-In
    • Adaptive Echo Cancellation
  • Resources
    • SDKs
    • UI Components
    • API Reference
LogoLogo
Ask AIPlaygroundLoginFree API Key
On this page
  • Purpose
  • Browser Implementation
  • Telephony Systems
  • What Echo Cancellation Addresses
  • When Echo Cancellation Doesn’t Apply
  • Noise Suppression, Barge-In & Broader Audio Preprocessing
Optimize

Voice Agent Adaptive Echo Cancellation

Learn some tips and strategies for dealing with echo cancellation.
Was this page helpful?
Previous
Built with

Adaptive echo cancellation reduces or eliminates audio feedback that occurs when a microphone picks up audio from speakers in the same environment, dynamically adjusting based on the audio conditions.

Purpose

Echo cancellation addresses audio feedback issues in real-time communication applications. Many modern browsers and telephony systems have built-in echo cancellation, so you may not need additional configuration depending on your Voice Agent implementation.

Browser Implementation

Chrome implements adaptive echo cancellation effectively through its built-in WebRTC capabilities for real-time communication. Most browsers and communication apps enable adaptive echo cancellation by default.

References for browser-based echo cancellation:

  • Native echo cancellation in Chrome
  • MediaTrackSettings: echoCancellation property
  • MediaTrackConstraints: echoCancellation property

Telephony Systems

Modern phones have hardware-level and software-level echo cancellation built in, ensuring users don’t hear their own voice echoed back during calls. When integrating with telephony services (PSTN or VoIP), the built-in echo cancellation handles audio issues automatically without requiring additional configuration.

Examples:

  • VoIP apps integrated with a phone’s native dialer benefit from built-in echo cancellation
  • No extra code or custom echo management is needed

What Echo Cancellation Addresses

Echo cancellation is most effective for:

  • Acoustic Echo: When microphones capture sound from nearby speakers during video calls
  • Audio Feedback: Preventing repeated amplification of sound in communication systems

Common scenarios where echo cancellation applies:

  • Video/audio calls with active speakers and microphones
  • Conferencing systems with multiple participants and devices
  • Real-time media apps like video chat or gaming applications

When Echo Cancellation Doesn’t Apply

If you’re experiencing playback issues unrelated to feedback loops (stuttering, latency, or low volume), the root cause is likely:

  • Poor internet connection in streaming applications
  • Hardware limitations (faulty speakers or headphones)
  • Incorrect audio configurations or codecs
  • Incorrect parameters in your Deepgram API request

In Voice Agent workflows, echo cancellation might not be the core issue if you’re experiencing other audio problems.

Noise Suppression, Barge-In & Broader Audio Preprocessing

Echo cancellation is one part of the audio preprocessing picture. For full recommendations on when to use noise suppression, echo cancellation, barge-in strategies, and other preprocessing techniques (including when they can hurt transcription accuracy), see Audio Preprocessing & Barge-In.