Configure Deepgram on Modal

Manage Deepgram Modal deployments by label, edit TOML configs and models on Modal Volumes, and configure deployments for STT or TTS.

Configure Deepgram

Modal Deepgram deployments are managed by labels. Each label specifies a set of Deepgram TOML files and models. Keep in mind that Deepgram models, such as Flux and Aura-2, must be deployed on independent Modal stacks, and cannot be co-hosted with any other models.

Set the label by exporting an environment variable.

$ export DEPLOY_LABEL=stt

These resources are stored on Modal Volumes when you run

$ modal run -m modal_deepgram.deepgram_resources \
>   --label $DEPLOY_LABEL \
>   --model-links-path <path-to-models-list-txt-file> \
>   --deploy-type license-proxy \
>   --source-api-config-file <api-toml-file> \
>   --source-engine-config-file <engine-toml-file>

This will

Download the Deepgram model weights to a Modal Volume
Pull the appropriate config files from Deepgram’s repo
Patch the config files
- Use localhost with the desired ports
- Specify the Modal Volume mount point for model weights

When calling modal run -m modal_deepgram.deepgram_resources, note that

the models .txt filepath should be local
the config file names should match one of those found in the Deepgram self-hosted resources repo
the --deploy-type argument takes either license-proxy or standard and will choose the appropriate directory to pull the configs (default is license-proxy)

Edit a Deepgram TOML config

Update config files for a deployment after the initial modal_deepgram.deepgram_resources run by pulling them from the Volume, editing, and uploading back to the Volume.

Pull the file locally:

$ modal volume get deepgram-cache configs/{label}/api.toml ./api.toml

Edit ./api.toml in your editor.

Push the change back:

$ modal volume put deepgram-cache ./api.toml configs/{label}/api.toml

Redeploy with DEPLOY_LABEL={label} modal deploy -m modal_deepgram.app to apply.

Update models

Passing --model-links-path when you run modal_deepgram.deepgram_resources wipes /models/{label}/ and downloads every URL in the file.

$ modal run -m modal_deepgram.deepgram_resources \
>   --label stt \
>   --model-links-path ./model-links.txt

Example

The quickstart shows you how to deploy a STT service. If you wanted to deploy Aura-2 TTS instead, you would follow these steps:

Save your Aura-2 model links to ./tts-model-links.txt.

Run prepare_resources with the tts label and Aura-2 config files. For language-specific deployments, swap the polyglot configs for variants like api.aura-2-en.toml / engine.aura-2-en.toml.

$ export DEPLOY_LABEL=tts
$ 
$ modal run -m modal_deepgram.deepgram_resources \
>   --label $DEPLOY_LABEL \
>   --model-links-path ./tts-model-links.txt \
>   --source-api-config-file api.aura-2-polyglot.toml \
>   --source-engine-config-file engine.aura-2-polyglot.toml

Update the hardware literals in modal_deepgram/app.py to TTS-recommended values — see Compute and Autoscaling → Configure hardware.

Deploy:

$ DEPLOY_LABEL=tts modal deploy -m modal_deepgram.app

Set Deepgram release version

To change the Deepgram release, edit DEEPGRAM_IMAGE_TAG in modal_deepgram/deepgram.py and redeploy the app. This will rebuild the container image.

$	modal run -m modal_deepgram.deepgram_resources \
>	--label $DEPLOY_LABEL \
>	--model-links-path <path-to-models-list-txt-file> \
>	--deploy-type license-proxy \
>	--source-api-config-file <api-toml-file> \
>	--source-engine-config-file <engine-toml-file>

$	modal run -m modal_deepgram.deepgram_resources \
>	--label stt \
>	--model-links-path ./model-links.txt

$	export DEPLOY_LABEL=tts
$
$	modal run -m modal_deepgram.deepgram_resources \
>	--label $DEPLOY_LABEL \
>	--model-links-path ./tts-model-links.txt \
>	--source-api-config-file api.aura-2-polyglot.toml \
>	--source-engine-config-file engine.aura-2-polyglot.toml