Tag: WER
-

Real-World Speech-to-Text Accuracy: Benchmarking AssemblyAI, Deepgram, WhisperX & Saaras on Production Audio
Every time a new AI speech-to-text model launches, we see impressive benchmark numbers. Gemini, GPT-4o, Whisper, Nova — and recently Saaras — all report strong results on datasets like LibriSpeech and Common Voice. But there’s a problem. Those benchmarks don’t represent real production audio. At Scribie, we run a professional transcription service. The files we…
-
Improved Automated Transcripts
Our latest speech and language models have been released. There are several new features in this release. The following is a list: Acoustic Model: This is our fourth acoustic model trained on our data. The dataset contained mostly accented speakers (eg. Indian, African, Irish etc.). It also contained some noisy files. The accuracy of the…