Topic Detection | Deepgram's Docs

Try this feature out in our API Playground.

topics boolean Default: false

Pre-recorded Streaming English (all available regions)

Deepgram’s Topic Detection feature identifies key topics within the transcript, returning a list of text segments and the topics found within each segment.

The list of topics that can be identified is not a fixed list; this TSLM powered feature is able to generate topics based on the context of the language content in the transcript. You may also choose to use the optional custom-topic parameter to provide a custom topic you want detected if present within your audio.

Enable Feature

To enable Topic Detection, use the following parameter in the query string when you call Deepgram’s /listen endpoint:

topics=true

To transcribe audio from a file on your computer, run the following curl command in a terminal or your favorite API client.

cURL

$ curl \
>   --request POST \
>   --header 'Authorization: Token YOUR_DEEPGRAM_API_KEY' \
>   --header 'Content-Type: audio/wav' \
>   --data-binary @youraudio.wav \
>   --url 'https://api.deepgram.com/v1/listen?topics=true'

Replace YOUR_DEEPGRAM_API_KEY with your Deepgram API Key.

Query Parameters

Parameter	Value	Type	Description
`topics`	`true`	boolean	Enables topic detection
`language`	`en`	string	The language of your input audio (Only English is supported at this time.)
`custom_topic`	ex: `animals`	string	Optional. A custom topic you want the model to detect within your input audio if present. Submit up to 100.
`custom_topic_mode`	`extended`, `strict`	string	Optional. Sets how the model will interpret strings submitted to the `custom_topic param`. When `strict`, the model will only return topics submitted using the `custom_topic` param. When `extended`, the model will return its own detected topics in addition to those submitted using the `custom_topic` param.

Analyze Response

When the file is finished processing, you’ll receive a JSON response that has the following basic structure:

JSON

1 {
2   "metadata": {...},
3   "results": {
4     "channels": [
5       {
6         "alternatives": [...]
7       }
8     ],
9     "topics": {
10       "segments": [
11         {
12           "text": "Can I upgrade my phone?",
13           "start_word": 13,
14           "end_word": 17,
15           "topics": [
16             { "topic": "Phone upgrade", "confidence_score": 0.9661531 }
17           ]
18         }
19       ]
20     }
21   }
22 }

The response object values for topics are:

segments: The list of segments of text identified by the model as containing notable topics.
topic: The name of the topic detected by the model.
confidence_score: a floating point from 0 to 1 representing the models confidence in this prediction.

API Warning Response

Warning

If you request Topic Detection with an unsupported language by specifying a language code such as topics=true&language=es or topics=true&detect_language=true where the detected language is unsupported, you will get the warning message below.

JSON

1 "warnings": [
2   {
3     "parameter": "topics",
4     "type": "unsupported_language",
5     "message": "Topics is only supported for English."
6   }
7 ]

Warning Name	Warning Message
`unsupported_language`	Feature isn’t supported with the specified or detected language.

Example Warning

Here is an example of the JSON structure of a request with warning object.

JSON

1 {
2  "metadata": {
3 ...
4        },
5      "warnings": [
6        {
7          "parameter": "topic",
8          "type": "unsupported_language",
9          "message": "Topics is only supported for English."
10        }
11      ]
12    },
13  "results": {
14        "channels": [
15             {
16                 "alternatives": [...]
17             }
18         ], 
19 
20     }
21 }

Use Cases

Some examples of uses for Topic Detection include:

Customers who want to help their Quality Assurance team analyze conversations to identify trends and patterns based on discussed topics.
Customers who need to extract meaningful and actionable insights from conversations and audio data based on discussed topics.
Customers who want to enhance search capabilities by tagging conversations based on identified topics.