Nova-3 Medical Model

Deepgram is proud to announce the release of Nova-3-Medical, our most advanced speech-to-text model designed specifically for clinical environments. Built on the foundation of Nova-3, this specialized model delivers superior transcription performance for healthcare applications.

Performance Improvements

  • 63.7% reduction in word error rate (WER) compared to next-best competitor (3.44% median WER)
  • 40.35% reduction in Keyword Error Rate (KER) compared to next-best competitor (6.79% KER)
  • 10.6% improvement in Keyword Recall Rate (KRR) over Nova-2-Medical (93.99% KRR)
  • Maintains industry-leading inference speed with ultra-low latency for real-time healthcare applications

New Features

  • Self-serve customization through Keyterm Prompting

    • Instantly adapt up to 100 domain-specific terms without model retraining
  • Enhanced capabilities for clinical environments

    • Improved handling of background noise typical in healthcare settings
    • Superior recognition of medication names, diagnostic terms, and procedure details
    • Accurate transcription even with far-field devices or ambient noise interference
    • Maintains exceptional accuracy in bustling clinics or hospitals with active medical equipment

Availability

Nova-3 English is now available through our API. To access:

  • Use model=nova-3-medical in your API calls
  • Available for hosted use
  • Supports both pre-recorded and real-time streaming transcription
  • Self-hosted deployments will be available in subsequent releases

For detailed information about Nova-3 Medical, please refer to our Developer Documentation.