Migrating from AssemblyAI Speech-to-Text to Deepgram

This guide provides a detailed step-by-step process for developers transitioning from AssemblyAI speech-to-text (STT) services to Deepgram’s STT services using the Deepgram SDKs. The goal is to ensure a smooth migration by highlighting differences and demonstrating equivalent functionalities between the two platforms.

Getting Started

Before you can use Deepgram, you’ll need to create a Deepgram account. Signup is free and includes $200 in free credit and access to all of Deepgram’s features!

Before you start, you’ll need to follow the steps in the Make Your First API Request guide to obtain a Deepgram API key, and configure your environment if you are choosing to use a Deepgram SDK.

Prerequisites

Before proceeding with the migration, ensure you meet the following prerequisites:

Required Tools

A code editor (e.g., Visual Studio Code)
Terminal or command prompt access
Node / Python installed

API Keys

AssemblyAI API Key: Obtain from your AssemblyAI dashboard.
Deepgram API Key: Sign up on Deepgram’s platform and get your API key in the Deepgram Console.

Overview of AssemblyAI and Deepgram APIs

Both AssemblyAI and Deepgram provide robust speech-to-text APIs, but they have different endpoints, request parameters, and response structures. This guide will map AssemblyAI functionalities to their Deepgram equivalents.

Step-by-Step Migration Instructions

1. Setting Up the Environment

Node: Ensure you have Node.js installed. If not, download and install it from the Node.js website.

Python: Ensure you have Python installed. If not, download and install it from the Python website.

2. Configuring API Keys

How to configure AssemblyAI API key

Create a .env file in your project directory and add your AssemblyAI API key:

.env

ASSEMBLYAI_API_KEY=your_assemblyai_api_key_here

How to configure Deepgram API key

Similarly, add your Deepgram API key to the .env file:

.env

DEEPGRAM_API_KEY=your_deepgram_api_key_here

3. Installing the SDK and Dependencies

For AssemblyAI:

$ npm install assemblyai dotenv

For Deepgram:

$ npm install @deepgram/sdk dotenv

4. Making API Requests

Initialization

AssemblyAI Initialization:

1 import { AssemblyAI } from "assemblyai";
2 import dotenv from "dotenv";
3 
4 dotenv.config();
5 
6 const client = new AssemblyAI({
7   apiKey: process.env.ASSEMBLYAI_API_KEY,
8 });

Deepgram Initialization:

1 import { Deepgram } from "@deepgram/sdk";
2 import dotenv from "dotenv";
3 
4 dotenv.config();
5 
6 const deepgram = new Deepgram(process.env.DEEPGRAM_API_KEY);

Add Request Parameters

AssemblyAI:

1 const data = {
2   audio_url: "https://dpgr.am/spacewalk.wav", // the audio_url for the audio being transcribed is included
3   speech_model: "nano",
4   speaker_labels: true,
5 };

Deepgram:

1 const options = {
2   model: "nova-3",
3   smart_format: true,
4   // Do not include the audio_url in this object
5 };

Example: Transcribe Audio Using a Remote URL

Here is the entire code sample that shows how to transcribe audio using a remote URL.

AssemblyAI:

1 import { AssemblyAI } from "assemblyai";
2 import dotenv from "dotenv";
3 dotenv.config();
4 
5 const client = new AssemblyAI({
6   apiKey: process.env.ASSEMBLYAI_API_KEY,
7 });
8 
9 const FILE_URL = "https://dpgr.am/spacewalk.wav";
10 
11 const data = {
12   audio_url: FILE_URL,
13   speech_model: "nano",
14   speaker_labels: true,
15 };
16 
17 const run = async () => {
18   const response = await client.transcripts.transcribe(data);
19   console.log(JSON.stringify(response));
20 };
21 
22 run();

Deepgram:

1 import { createClient } from "@deepgram/sdk";
2 import dotenv from "dotenv";
3 dotenv.config();
4 
5 const data = {
6   url: "https://dpgr.am/spacewalk.wav",
7 };
8 
9 const options = {
10   model: "nova-3",
11   diarize: true,
12 };
13 
14 const run = async () => {
15   const deepgram = createClient(process.env.DEEPGRAM_API_KEY);
16 
17   const response = await deepgram.listen.prerecorded.transcribeUrl(
18     data,
19     options
20   );
21   console.dir(JSON.stringify(response), { depth: null });
22 };
23 
24 run();

Example: Transcribe Audio Using a Local File

Here is the entire code sample that shows how to transcribe audio using a local file.

AssemblyAI:

1 import { AssemblyAI } from "assemblyai";
2 import dotenv from "dotenv";
3 dotenv.config();
4 
5 const client = new AssemblyAI({
6   apiKey: process.env.ASSEMBLYAI_API_KEY,
7 });
8 
9 const AUDIO_FILE = "sample.wav";
10 
11 const data = {
12   audio: AUDIO_FILE,
13   speech_model: "nano",
14   speaker_labels: true,
15 };
16 
17 const run = async () => {
18   const response = await client.transcripts.transcribe(data);
19   console.log(JSON.stringify(response));
20 };
21 
22 run();

Deepgram:

1 import { createClient } from "@deepgram/sdk";
2 import fs from "fs";
3 import dotenv from "dotenv";
4 dotenv.config();
5 
6 const deepgram = createClient(process.env.DEEPGRAM_API_KEY);
7 
8 const data = fs.readFileSync("sample.wav");
9 
10 const options = {
11   model: "nova-3",
12   diarize: true,
13 };
14 
15 const run = async () => {
16   const response = await deepgram.listen.prerecorded.transcribeFile(
17     data,
18     options
19   );
20   console.dir(JSON.stringify(response), { depth: null });
21 };
22 
23 run();

7. Handling Responses

Compare the JSON responses:

AssemblyAI:

JSON

1 {
2   "id": "some_id",
3   "status": "completed",
4   "audio_url": "https://dpgr.am/spacewalk.wav",
5   "text": "Transcript text here...",
6    "words": [
7     {
8       "start": 255,
9       "end": 767,
10       "text": "Yeah.",
11       "confidence": 0.97465,
12       "speaker": null
13     },
14   ]
15 }

Deepgram:

JSON

1 {
2   "metadata": {
3     "transaction_key": "deprecated",
4     "request_id": "unique_request_id",
5     "created": "2024-02-06T19:56:16.180Z",
6     "duration": 25.933313,
7     "channels": 1,
8     "models": ["1abfe86b-e047-4eed-858a-35e5625b41ee"],
9     "model_info": {}
10   },
11   "results": {
12     "channels": [
13       {
14         "alternatives": [
15           {
16             "transcript": "Transcript text here...",
17             "confidence": 0.99902344,
18             "words": [
19               {
20                 "word": "yeah",
21                 "start": 0.08,
22                 "end": 0.32,
23                 "confidence": 0.9975586,
24                 "punctuated_word": "Yeah."
25               }
26             ]
27           }
28         ]
29       }
30     ]
31   }
32 }

8. Code Migration

Adapting code to handle Deepgram’s response structure involves accessing nested fields within the JSON response. For instance, response.results.channels[0].alternatives[0].transcript will give you the transcript text.

Be sure to update your data parsing logic to correctly navigate the nested response format, and thoroughly test the new code to ensure it handles various edge cases and accurately extracts the needed information.

Migrating from AssemblyAI Speech-to-Text to Deepgram

Getting Started

Prerequisites

Required Tools

API Keys

Overview of AssemblyAI and Deepgram APIs

Step-by-Step Migration Instructions

1. Setting Up the Environment

2. Configuring API Keys

How to configure AssemblyAI API key

How to configure Deepgram API key

3. Installing the SDK and Dependencies

4. Making API Requests

Initialization

Add Request Parameters

Example: Transcribe Audio Using a Remote URL

Example: Transcribe Audio Using a Local File

7. Handling Responses

Compare the JSON responses:

8. Code Migration

9. Testing and Validation

Steps to test the integration

Validating transcription accuracy and performance