OpenAI Whisper Accuracy (Tflite, Whisper.CPP and Large-V2)

December 11, 2022

Whisper popularity wave continues. Many projects appear for whisper-based web services, whisper on mobile and so on. Some projects modify Whisper models and algorithms to improve speed and it raises...

Kaldi Gigaspeech Vosk Model Release

November 13, 2022

Recently Kaldi project released a pack of models trained on Gigaspeech. You can find them [here](http://kaldi-asr.org/models/m14) Models are good, not significantly better than our previous model, but not significantly worse...

OpenAI Whisper Accuracy and other recent models (Nemo Transducer XLarge, Gigaspeech)

October 22, 2022

Everyone is crazy about OpenAI Whisper. Trained on 680 thousand hours of multilingual data it indeed sets a new stage in speech recognition. We tested it and some other recent...

Voting, Ensembles and bringing AI to life

June 14, 2022

While people recently argue if Google's model [is sentient](https://news.ycombinator.com/item?id=31721584) we must admit that another important property of the living creatures emerged in recent AI models - they started to have...

Why Chinese WER is important

May 29, 2022

Almost a year we haven't updated the news here. Time goes fast and new things keep us busy. There are some news to discuss, but they are mostly worth a...

Multistream TDNN and new Vosk model

July 16, 2021

What I really like in speech recognition and what keeps me excited about it is an active on-going development of speech recognition technology which boosts both speech recognition results and,...

Active learning in speech recognition

July 13, 2021

While dataset sizes grow beyond 10 thousand hours (Gigaspeech) the compute requirements for speech recognition research also grow. Any research even a simple architecture testing gets harder and harder because...

NVIDIA Nemo Conformer-CTC model test results

June 15, 2021

Not long after Citrinet Nvidia NeMo released Conformer-CTC model. As usual, forget about Citrinet now, Conformer-CTC is way better. The model is available for download [here](https://ngc.nvidia.com/catalog/models/nvidia:nemo:stt_en_conformer_ctc_large), latest Nemo repo supports...

ICASSP 2021 Part 1

June 05, 2021

This week ICASSP 2021 starts online. A bit late time for a year and everyone already looked on the publications. Many papers are already on Arxiv for some time, some...

NVIDIA Nemo Citrinet model test results

April 23, 2021

The race for biggest model continue. Recently NVIDIA came out with a Citrinet model, a bigger and more advanced version of Quartznet. The publication is: [Citrinet: Closing the Gap between...