Profanity Filtering Expanded Language Support
Profanity Filtering Gets Expanded Language Support
Profanity filtering now supports 6 additional languages beyond English, giving you content moderation capabilities across your global user base. Available on monolingual models for:
Newly Supported:
- German (
de) - Swiss German (
de-CH) - Polish (
pl) - Portuguese (
pt,pt-BR,pt-PT) - Spanish (
es,es-419) - Swedish (
sv,sv-SE)
Existing English Support:
en,en-US,en-AU,en-GB,en-NZ,en-IN
This expansion lets you deploy consistent content policies across international markets without building custom filtering logic.
Smart Formatting Improvements
We’ve resolved several high-impact formatting edge cases that were causing transcription accuracy issues in production environments:
Improved Entity Formatting via Smart Format
Email Transcription Improvements
- Fixed:
'o'characters in email addresses now transcribe correctly instead of converting to'0' - Fixed: edge case email mentions that were being dropped entirely in specific batch processing scenarios
Certain formerly numeric-only sequences have been updated to correctly preserve all alphanumeric characters:
- Before (some entities):
"my account number is a b c d zero nine"→"my account number is 09" - After (some entities):
"my account number is a b c d zero nine"→"my account number is ABCD09"
Quantity modifiers (‘single’, ‘double’, ‘triple’ + standalone character or number) are better handled via Smart Format:
- Before (some entities):
"double 2"→"2" - After (some entities):
"double 2"→"22"
Special cases of ‘hundred’ or ‘a hundred’ now supported via Smart Format:
- Before (some entities):
"hundred percent"→"%" - After (some entities):
"hundred percent"→"100%"
This update has gone out to all hosted streaming transcription, and will be applied to our next self-hosted release later this month.