Profanity Filtering Gets Expanded Language Support

Profanity filtering now supports 6 additional languages beyond English, giving you content moderation capabilities across your global user base. Available on monolingual models for:

Newly Supported:

  • German (de)
  • Swiss German (de-CH)
  • Polish (pl)
  • Portuguese (pt, pt-BR, pt-PT)
  • Spanish (es, es-419)
  • Swedish (sv, sv-SE)

Existing English Support:

  • en, en-US, en-AU, en-GB, en-NZ, en-IN

This expansion lets you deploy consistent content policies across international markets without building custom filtering logic.

Smart Formatting Improvements

We’ve resolved several high-impact formatting edge cases that were causing transcription accuracy issues in production environments:

Improved Entity Formatting via Smart Format

Email Transcription Improvements

  • Fixed: 'o' characters in email addresses now transcribe correctly instead of converting to '0'
  • Fixed: edge case email mentions that were being dropped entirely in specific batch processing scenarios

Certain formerly numeric-only sequences have been updated to correctly preserve all alphanumeric characters:

  • Before (some entities): "my account number is a b c d zero nine""my account number is 09"
  • After (some entities): "my account number is a b c d zero nine""my account number is ABCD09"

Quantity modifiers (‘single’, ‘double’, ‘triple’ + standalone character or number) are better handled via Smart Format:

  • Before (some entities): "double 2""2"
  • After (some entities): "double 2""22"

Special cases of ‘hundred’ or ‘a hundred’ now supported via Smart Format:

  • Before (some entities): "hundred percent""%"
  • After (some entities): "hundred percent""100%"

This update has gone out to all hosted streaming transcription, and will be applied to our next self-hosted release later this month.