Agent Audio Done
The server sends AgentAudioDone after streaming the final audio chunk for an agent utterance.
Voice Agent
The server sends AgentAudioDone immediately after the last audio chunk for an agent utterance. Use this event to mark the end of a server-side stream so you can synchronize follow-up actions with the agent finishing.
Client behavior
AgentAudioDone does not mean the user has heard the agent finish. Audio you already received may still sit in your local playback buffer. To detect end-of-playback in the user’s ears, watch your audio output queue, not this event.
Example payload
JSON