Continuous Text Stream

Deepgram Text to Speech WebSocket

Handshake

GET

Headers

AuthorizationstringRequired

API key for authentication. Format should be be either ‘token <DEEPGRAM_API_KEY>’ or ‘Bearer <JWT_TOKEN>’

Query parameters

encodingenumOptionalDefaults to mp3

Encoding allows you to specify the expected encoding of your audio output

modelenumOptionalDefaults to aura-asteria-en

AI model used to process submitted text

sample_rateenumOptionalDefaults to 24000

Sample Rate specifies the sample rate for the output audio. Based on encoding 8000 or 24000 are possible defaults. For some encodings sample rate is not configurable.

Allowed values: 800016000240004410048000

Send

textToSpeechRequestobject
OR
speak_controlMessagesRequestobject

Receive

abc
textToSpeechResponsestring
OR
controlMessagesResponseobject
OR
speak_closeFrameobject
Built with