Media Inputs & Outputs
Use different media inputs and outputs when using the Voice Agent API.
Deepgram’s APIs provides robust support for both media input and output settings, enabling users to customize audio data processing and output generation to suit a variety of Voice Agent applications.
Speech to Text: Media Input Settings
Media input settings allow you to define the parameters for audio data submitted for processing. These settings help optimize the transcription process by specifying the characteristics of the audio data. Below is a summary of the available options for media input settings:
Text to Speech: Media Output Settings
Once the input audio is processed, Deepgram provides robust options for generating speech output tailored to your voice agent’s requirements. These settings enable customization of the synthesized audio or transcription results for downstream use.