Filler Words
Filler Words can transcribe disfluencies in your audio, like "uh" and "um".
filler_words boolean. Default: false
Enable Feature
To enable Filler Words, use the following parameter in the query string when you call Deepgram’s /listen endpoint :
model=nova-2&filler_words=true
Currently, Filler Words are only available for Deepgram's Nova and Nova-2 general models.
Deepgram is capable of transcribing the following filler words:
- uh
- um
- mhmm
- mm-mm
- uh-uh
- uh-huh
- nuh-uh
These words will always be transcribed with the spelling listed above, regardless of their spoken duration (i.e., Deepgram will never transcribe "uhhhh" instead of "uh").
When filler_words=false or the parameter is not set, the two most common fillers, "uh" and "um", are stripped out of the transcript to improve readability.
curl \
--request POST \
--header 'Authorization: Token YOUR_DEEPGRAM_API_KEY' \
--header 'Content-Type: audio/wav' \
--data-binary @youraudio.wav \
--url 'https://api.deepgram.com/v1/listen?tier=nova&model=general&filler_words=true'
Replace
YOUR_DEEPGRAM_API_KEYwith your Deepgram API Key.
Results
Once applied, results will appear in the transcript.
| Truth | With Filler Words | Without Filler Words |
|---|---|---|
| uh-huh or you'd want something where uh so let's say you're trying to fine-tune a model to something very specific um so it's not as uh cut and dry as a more general task | uh-huh or you'd want something where uh so let's say you're trying to fine-tune a model to something very specific um so it's not as uh cut and dry as a more general task | uh-huh or you'd want something where so let's say you're trying to fine-tune a model to something very specific so it's not as cut and dry as a more general task |
Use Cases
Some examples of use cases for Filler Words include:
- Customers who need to collect analytics on the number of filler words spoken for coaching purposes.
- Customers who need to remove filler words from the audio based on where they appear in transcripts.
Updated about 13 hours ago