For AI agents: a documentation index is available at the root level at /llms.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
LogoLogo
Ask AIPlaygroundLoginFree API Key
HomeAPI ReferenceVoice AgentSpeech-to-TextText-to-SpeechIntelligenceSelf-Hosted Deployments
HomeAPI ReferenceVoice AgentSpeech-to-TextText-to-SpeechIntelligenceSelf-Hosted Deployments
    • Home
    • Ask AI
    • Support
    • Changelog
  • Trust & Security
    • Security Policy
    • Data Privacy Compliance
    • Information Security & Privacy
  • SDKs
    • SDK Features
  • Guides
  • Home
  • Ask AI
  • Support
  • Changelog
  • June 11, 2026
  • May 29, 2026
  • May 28, 2026
  • May 27, 2026
  • May 21, 2026
  • May 19, 2026
  • May 15, 2026
  • May 14, 2026
  • May 14, 2026
  • May 13, 2026
  • May 12, 2026
  • May 12, 2026
  • May 11, 2026
  • May 4, 2026
  • April 30, 2026
  • April 29, 2026
  • April 23, 2026
  • April 16, 2026
  • April 15, 2026
  • April 15, 2026
  • April 8, 2026
  • April 3, 2026
  • April 2, 2026
  • April 1, 2026
  • March 31, 2026
  • March 26, 2026
  • March 19, 2026
  • March 17, 2026
  • March 16, 2026
  • March 10, 2026
  • March 9, 2026
  • March 5, 2026
  • March 5, 2026
  • February 27, 2026
  • February 25, 2026
  • February 12, 2026
  • February 11, 2026
  • February 6, 2026
  • February 5, 2026
  • February 3, 2026
  • January 29, 2026
  • January 27, 2026
  • January 21, 2026
  • January 16, 2026
  • January 15, 2026
  • January 13, 2026
  • December 29, 2025
  • December 12, 2025
  • December 11, 2025
  • December 10, 2025
  • December 3, 2025
  • November 26, 2025
  • November 24, 2025
  • November 18, 2025
  • November 14, 2025
  • November 12, 2025
  • November 10, 2025
  • November 7, 2025
  • November 5, 2025
  • November 4, 2025
  • October 29, 2025
  • October 16, 2025
  • October 15, 2025
  • October 2, 2025
  • September 29, 2025
  • September 26, 2025
  • September 16, 2025
  • September 15, 2025
  • September 12, 2025
  • September 4, 2025
  • September 4, 2025
  • September 2, 2025
  • August 29, 2025
  • August 28, 2025
  • August 15, 2025
  • August 14, 2025
  • August 11, 2025
  • August 8, 2025
  • August 6, 2025
  • July 31, 2025
  • July 31, 2025
  • July 22, 2025
  • July 17, 2025
  • July 11, 2025
  • July 10, 2025
  • June 26, 2025
  • June 25, 2025
  • June 11, 2025
  • June 10, 2025
  • May 21, 2025
  • May 14, 2025
  • May 5, 2025
  • April 2, 2025
  • March 31, 2025
  • March 31, 2025
  • March 11, 2025
  • March 7, 2025
  • March 3, 2025
  • February 12, 2025
  • January 30, 2025
  • December 26, 2024
  • December 16, 2024
  • November 21, 2024
  • October 24, 2024
  • September 27, 2024
  • September 18, 2024
  • September 12, 2024
  • August 27, 2024
  • August 19, 2024
  • August 15, 2024
  • August 1, 2024
  • July 25, 2024
  • July 22, 2024
  • July 16, 2024
  • June 27, 2024
  • May 30, 2024
  • May 24, 2024
  • May 21, 2024
  • May 14, 2024
  • April 26, 2024
  • April 18, 2024
  • April 11, 2024
  • March 25, 2024
  • February 29, 2024
  • January 17, 2024
  • January 4, 2024
  • December 21, 2023
  • December 7, 2023
  • November 14, 2023
  • October 26, 2023
  • October 6, 2023
  • September 28, 2023
  • September 20, 2023
  • September 19, 2023
  • August 4, 2023
  • July 19, 2023
  • July 7, 2023
  • June 30, 2023
  • June 6, 2023
  • June 2, 2023
  • May 11, 2023
  • April 13, 2023
  • March 15, 2023
  • February 28, 2023
  • February 27, 2023
  • January 24, 2023
  • January 20, 2023
  • December 15, 2022
  • November 30, 2022
  • November 23, 2022
  • November 11, 2022
  • November 8, 2022
  • November 1, 2022
  • October 31, 2022
  • October 27, 2022
  • October 17, 2022
  • October 14, 2022
  • October 7, 2022
  • September 20, 2022
  • September 19, 2022
  • September 15, 2022
  • August 28, 2022
  • August 22, 2022
  • August 17, 2022
  • July 19, 2022
  • July 8, 2022
  • July 7, 2022
  • May 26, 2022
  • April 29, 2022
  • April 21, 2022
  • March 23, 2022
  • March 14, 2022
  • March 7, 2022
  • January 18, 2022
  • November 18, 2021
  • July 13, 2021
  • May 19, 2021
  • May 5, 2021
  • March 17, 2021
  • March 3, 2021
  • February 17, 2021
  • February 1, 2021
  • July 30, 2020
  • Getting Started
  • Installation
  • Authentication
  • Speech-to-Text
  • Text-to-Speech
  • Text Intelligence
  • Account Management
  • MCP Server
  • Shell Completion
  • Plugin System
  • Agentic developer tools
  • Security Policy
  • Data Privacy Compliance
  • Information Security & Privacy
  • SDK Features
  • Make Your First API Request
  • Authenticating
  • Token-based Auth
  • Model Metadata
  • Using Custom Add On Parameters with SDKs
  • Tagging Your Usage Data
  • Managing Projects
  • Using Multiple Projects
  • Working With Roles & API Scopes
  • Supported Audio Formats
  • Deployment Options
  • Creating Additional API Keys
  • Safely Removing Team Members With Active API Keys
  • Using the Sec-WebSocket-Protocol
  • Logs & Usage Data
  • Model Improvement Program
  • Working With Concurrency Rate Limits
  • Audio Preprocessing & Barge-In
  • Understanding NAT Port Exhaustion
  • Voice Agent with Pipecat and Deepgram
  • Voice Agent with LiveKit and Deepgram
  • Voice Agent with Twilio & OpenAI & Deepgram
  • Calculate Talk Time Analytics
  • Transcribe Meetings in Realtime
  • Transcribe Recorded Calls With Twilio
  • Amazon Connect and Deepgram
  • AWS S3 Presigned URLs and Deepgram
  • AudioCodes (LiveHub) and Deepgram STT
  • Genesys and Deepgram
  • LiveKit and Deepgram
  • Google Dialogflow CX and Deepgram
  • Make.com and Deepgram
  • Pipecat and Deepgram
  • Twilio and Deepgram STT
  • Twilio and Deepgram TTS
  • Zapier and Deepgram
  • Zoom and Deepgram
Ask AIPlaygroundLoginFree API Key
On this page
  • June 11, 2026
  • Deepgram Self-Hosted June 2026 Release (260611)
  • May 29, 2026
  • Nova-3 Medical Batch Model Upgrade
  • May 28, 2026
  • Deepgram Self-Hosted May 2026 Release (260528)
  • May 27, 2026
  • Gemini 3.5 Flash Now Available
  • May 21, 2026
  • Profanity Filtering Now Supported for All Multilingual Models; Korean Spacing Improvements
  • May 19, 2026
  • Gemini 3.1 Flash Lite Now Available
  • May 15, 2026
  • Numerals Support Now Available for 3 New Languages: Russian, Romanian, and Hebrew (Monolingual Models)
  • May 14, 2026
  • Deepgram Self-Hosted May 2026 Release (260514)
  • Profanity Filtering Now Available in 50+ Languages
  • May 13, 2026
  • Diarization v2: Improved Batch Speaker Diarization
  • Nova-3 Portuguese Model Update

Changelog


June 11, 2026
June 11, 2026

May 29, 2026
May 29, 2026

May 28, 2026
May 28, 2026

May 27, 2026
May 27, 2026

May 21, 2026
May 21, 2026

May 19, 2026
May 19, 2026

May 15, 2026
May 15, 2026

May 14, 2026
May 14, 2026

May 14, 2026
May 14, 2026

May 13, 2026
May 13, 2026

Older posts

Next
Built with

June 11, 2026

Deepgram Self-Hosted June 2026 Release (260611)

Container Images (release 260611)

  • quay.io/deepgram/self-hosted-api:release-260611

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.191.0-1
  • quay.io/deepgram/self-hosted-engine:release-260611

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.118.0-1
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-260611

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.10.1-1
  • quay.io/deepgram/self-hosted-billing:release-260611

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.13.0

Action Required: Engine Container GPU Environment Variables

The Engine container change previewed in the May 28, 2026 release has shipped in this release. The Engine container now requires two environment variables to access the GPU:

  • NVIDIA_VISIBLE_DEVICES=all
  • NVIDIA_DRIVER_CAPABILITIES=compute,utility

If they are not set, the Engine container will fail to start after you upgrade to release-260611. Follow the step for your deployment method before pulling the release-260611 Engine image:

  • Official Helm chart (deepgram-self-hosted): Upgrade to chart version 0.37.0 or later. The chart sets both variables on the Engine pod automatically whenever a GPU is requested, so no manual change is needed. If you pin an older chart version, bump it as part of adopting this release.

  • Deepgram-provided Docker or Podman Compose files: Pull the latest files from deepgram/self-hosted-resources. They already set both variables on the Engine service.

  • Your own deployment manifests: Add both variables to the Engine container’s environment. For example, in a Docker or Podman Compose file:

    1services:
    2 engine:
    3 image: quay.io/deepgram/self-hosted-engine:release-260611
    4 runtime: nvidia
    5 environment:
    6 NVIDIA_VISIBLE_DEVICES: "all"
    7 NVIDIA_DRIVER_CAPABILITIES: "compute,utility"

    Or, for a Kubernetes Engine container spec:

    1env:
    2 - name: NVIDIA_VISIBLE_DEVICES
    3 value: "all"
    4 - name: NVIDIA_DRIVER_CAPABILITIES
    5 value: "compute,utility"

After setting the variables, upgrade your image tags to release-260611 and restart the Engine. Confirm it reaches a healthy state (for example, GET /v1/status returns 200) before routing production traffic.

This Release Contains The Following Changes

  • Persian Profanity Filtering — profanity_filter=true now masks recognized profanity in Persian (fa) transcripts. See Profanity Filtering for the supported language list and usage.
  • English Redaction on Flux Streaming — redact now applies to English transcripts on the Flux streaming endpoint (/v2/listen).
  • Streaming Diarization Model Selection — a new diarize_model parameter selects the diarization model on streaming requests; accepted values are v1 and latest, and setting it enables diarization (no separate diarize=true required). See Diarization for details.
  • Number Formatting Improvements — number formatting now covers Simplified Mandarin (zh), Cantonese (zh-HK), and Bulgarian (bg). Across languages, ordinals written as numerals format more consistently, and indefinite articles (“a”/“an”) format as digits in quantity contexts.
  • Text-to-Speech Output Transcoding — /v1/speak now supports optional output transcoding to additional audio formats.
  • Aura-2 Numeric Pronunciation Fix — corrects a pronunciation issue on all-numeric inputs.
  • Voice Agent Third-Party Provider Reliability — improves reliability of ElevenLabs streaming and Cartesia cancellation handling for self-hosted Voice Agent.
  • General Improvements — Keeps our software up-to-date.

Nova-3 Medical Batch Model Upgrade

🆕 Improved Nova-3 Medical Batch Model Released

We’ve released an upgraded Nova-3 Medical batch model with improved medical term recognition.

Key Improvements:

  • Expands the model’s medical vocabulary with new medical terms
  • Boosts medical term recognition to 97.20% KRR
  • Maintains WER parity with no observed regression on common or conversational terms

Availability

  • Select the model by setting model=nova-3-medical in your batch transcription requests.

Learn more about our models and supported languages on the Models & Languages Overview page.

May 28, 2026

Deepgram Self-Hosted May 2026 Release (260528)

Container Images (release 260528)

  • quay.io/deepgram/self-hosted-api:release-260528

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.188.1
  • quay.io/deepgram/self-hosted-engine:release-260528

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.117.0
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-260528

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.10.1
  • quay.io/deepgram/self-hosted-billing:release-260528

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.13.0

Preparation for a future Engine container change

The official Helm chart (0.37.0 and later) and the Docker and Podman compose files in deepgram/self-hosted-resources now set NVIDIA_VISIBLE_DEVICES=all and NVIDIA_DRIVER_CAPABILITIES=compute,utility on the Engine container. These env vars are no-ops with the release-260528 Engine image but are required for an upcoming Engine container refactor; deployments that adopt them now will not need a configuration change when that refactor ships. If you maintain your own deployment manifests, adding these env vars to the Engine container is safe to do at any time.

This Release Contains The Following Changes

  • Profanity Filtering for STT Multilingual — profanity_filter=true now masks recognized profanity in STT multilingual transcripts (language=multi). See Profanity Filtering for the supported language list and usage.
  • Improved Nova-3 Korean Word Spacing — Fixes an issue where Nova-3 Korean transcripts (ko, ko-KR) were sometimes missing spaces between words. Transcripts now better reflect proper Korean spacing.
  • General Improvements — Keeps our software up-to-date.

Gemini 3.5 Flash Now Available

gemini-3.5-flash is now available as a managed Google LLM in the Voice Agent API. This Standard tier model brings improved performance and efficiency to your voice agents.

Set the model in your agent configuration:

1{
2 "agent": {
3 "think": {
4 "provider": {
5 "type": "google",
6 "model": "gemini-3.5-flash",
7 "temperature": 0.5
8 }
9 }
10 }
11}

Deprecation Notice

The Gemini 2.5 Flash family of models is deprecating in October. Start testing newer models now to ensure a smooth migration.

For more details on Gemini model deprecations, see Google’s Gemini deprecations page.

For the full list of supported models and pricing tiers, see the Voice Agent LLM Models documentation.

Profanity Filtering Now Supported for All Multilingual Models; Korean Spacing Improvements

🆕 Profanity Filtering for Multilingual Models

Deepgram’s Profanity Filtering feature is now available for all multilingual models: Nova-2 multilingual, Nova-3 multilingual, and Flux multilingual (language=multi). You can enable profanity filtering in your API requests by setting the profanity_filter=true parameter. When enabled, inappropriate language is automatically replaced with asterisks (****) in the transcript.

This extends profanity filtering beyond single-language models, making it easier to process and moderate content in multilingual scenarios.

Learn more about using Profanity Filtering and see the full list of supported languages on the Profanity Filtering documentation page.

🛠️ Fix: Improved Word Spacing in Korean Transcripts

We fixed an issue affecting Korean transcripts (ko, ko-KR) where word spacing was sometimes missing. Transcripts should now better reflect proper Korean spacing, improving readability for users working with Korean audio.

See the full list of supported languages on the Models & Languages Overview page.

Gemini 3.1 Flash Lite Now Available

gemini-3.1-flash-lite is now available as a managed Google LLM in the Voice Agent API. This Standard tier model replaces the preview version.

Set the model in your agent configuration:

1{
2 "agent": {
3 "think": {
4 "provider": {
5 "type": "google",
6 "model": "gemini-3.1-flash-lite",
7 "temperature": 0.5
8 }
9 }
10 }
11}

Deprecations

  • gemini-3.1-flash-lite-preview is deprecated and will be removed on May 26, 2025. Migrate to gemini-3.1-flash-lite.

For more details on Gemini model deprecations, see Google’s Gemini deprecations page.

For the full list of supported models and pricing tiers, see the Voice Agent LLM Models documentation.

Numerals Support Now Available for 3 New Languages: Russian, Romanian, and Hebrew (Monolingual Models)

Supported languages and language codes:

  • Russian (ru)
  • Romanian (ro)
  • Hebrew (he)

You can now use Deepgram’s Numerals feature with monolingual models for Russian, Romanian, and Hebrew. Numerals converts spoken numbers into digits (for example, “three hundred” → “300”) in your transcript, helping you create more accurate and easily processed results.

How to use Numerals:
To enable numerals, add the numerals=true parameter to your Deepgram API request.

Learn more about using Numerals and see the full list of supported languages on the Numerals documentation page.

May 14, 2026

Deepgram Self-Hosted May 2026 Release (260514)

Container Images (release 260514)

  • quay.io/deepgram/self-hosted-api:release-260514

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-api:1.187.0
  • quay.io/deepgram/self-hosted-engine:release-260514

    • Equivalent image to:

      • quay.io/deepgram/self-hosted-engine:3.117.0
    • Minimum required NVIDIA driver version: >=570.172.08

  • quay.io/deepgram/self-hosted-license-proxy:release-260514

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-license-proxy:1.10.1
  • quay.io/deepgram/self-hosted-billing:release-260514

    • Equivalent image to:
      • quay.io/deepgram/self-hosted-billing:1.13.0

Batch Diarization v2 model delivery for new self-hosted deployments

Release 260514 ships Deepgram’s new batch diarization model (v2) to self-hosted. New deployments provisioned through your Deepgram representative will receive only the v2 batch diarizer model on disk by default. To produce diarized output on a fresh deployment, batch requests must specify diarize_model=v2 or diarize_model=latest. diarize=true on its own is pinned to v1; on a 260514 deployment that does not have the v1 model on disk, /v1/listen?diarize=true returns a successful response with no speaker labels — consistent with Deepgram’s longstanding behavior when a requested diarizer model is not present.

Existing deployments retain their v1 batch diarizer and continue to work without changes. To add v2 to an existing deployment, contact your Deepgram representative.

This Release Contains The Following Changes

  • Batch Diarization v2 — A new batch diarization model with significantly improved speaker labeling, preferred 3.3× over v1 in side-by-side human evaluation. Strongest gains on contact-center audio (~80% reduction in median Confusion Error Rate vs. v1, ~60% at p95). Compatible with Nova-1, Nova-2, Nova-3, plus enhanced and base batch models; monolingual and multilingual. Not compatible with Whisper. The API response format is unchanged from v1. Batch-only; streaming diarization is unchanged. See Speaker Diarization for details.
  • New diarize_model Parameter — Opt into v2 by passing diarize_model=v2 (pin to v2) or diarize_model=latest (recommended; auto-upgrades to future diarizer iterations) on pre-recorded /v1/listen requests. Unrecognized values return 400 Bad Request. Streaming requests reject diarize_model and return 400; use diarize=true for streaming diarization. diarize=true on batch continues to route to v1 to preserve behavior for existing integrations.
  • General Improvements — Keeps our software up-to-date.

Profanity Filtering Now Available in 50+ Languages

We’re excited to announce the release of profanity filtering support for over 50 monolingual languages. Deepgram’s profanity filter automatically detects and redacts offensive language in transcripts, helping you produce cleaner and safer content across a wide range of languages.

How to Use Profanity Filtering

To enable profanity filtering, add the profanity_filter=true parameter to your Deepgram API request:

For more details, supported languages, and additional options, visit the Profanity Filter page.

Diarization v2: Improved Batch Speaker Diarization

A new batch diarization model is available today via the diarize_model API parameter.

Deepgram is rolling out v2 of our batch speaker diarization model. v2 is a new architecture available today on an opt-in basis through the new diarize_model parameter. In side-by-side human evaluation, v2 was preferred 3.3× over our current production diarizer (v1), with the largest gains on contact-center audio — median CER reduced roughly 80% compared to the prior version of the diarization model. Customers using diarize=true are unaffected.

Key Features:

  • New diarize_model parameter — A single parameter that both enables diarization and selects the version. Most customers should choose latest; v2 or v1 are also accepted.
  • diarize_model=latest auto-upgrades — Resolves to the newest GA diarizer. Today that’s v2.
  • No breaking changes — diarize=true continues to route to v1.
  • Compatible with the rest of the platform — Works with Nova-1, Nova-2, Nova-3, enhanced, and base batch models (async and sync), monolingual and multilingual, alongside existing batch features.

New diarize_model parameter:

The new diarize_model parameter enables diarization and selects the model version in a single parameter — no need to also set diarize=true:

https://api.deepgram.com/v1/listen?model=nova-3&diarize_model=latest
ValueDescription
latestResolves to the newest GA diarizer
v2New improved batch diarizer
v1Original production diarizer

Migration guidance:

  • New integrations: For new projects we recommend diarize_model=latest. To pin a specific version, use diarize_model=v2 or diarize_model=v1.
  • Existing diarize=true users: No breaking changes — your existing requests continue to work with v1. To pick up v2’s improvements, update your requests to diarize_model=latest (always newest) or diarize_model=v2. We recommend testing on a representative sample of your audio before flipping production traffic.

No pricing changes. Diarization continues to be included at current rates.

Availability

  • Available now on the /v1/listen endpoint, on both US-hosted and EU-hosted endpoints
  • Supported on Nova-1, Nova-2, Nova-3, enhanced, and base batch models (async and sync), monolingual and multilingual
  • Streaming: diarize_model is not accepted on streaming requests and returns 400. Use diarize=true for streaming diarization. Streaming improvements ship separately.
  • Self-hosted support is now available. See Batch Diarization v2 Model Delivery for Self-Hosted Deployments for details.

Learn more in the Speaker Diarization documentation.

Nova-3 Portuguese Model Update

Improved Nova-3 Portuguese Model

We’ve enhanced the Nova-3 Portuguese model with improved transcription accuracy across Portuguese language variants, including Brazilian Portuguese (pt-BR) and European Portuguese (pt-PT).

To use the updated model, set model="nova-3" and use one of the supported Portuguese language codes:

  • language="pt"
  • language="pt-BR"
  • language="pt-PT"

Learn more about Nova-3 and supported languages on the Models and Language Overview page.