Recently I came around a nice demo about generation of natural sounds from physical models. This is really an exciting topic because while Hollywood can now draw almost everything like...
Recently a great collection of videos from Speech and Audio in the Northeast (SANE) 2015 workshop has been shared. The main topic of the workshop was sound signal separation which I consider...
I've recently met an interesting paper worth considerationRethinking Algorithm Design and Development in Speech Processingby Thilo Stadelmann et alThis is not mainstream research, but it is exactly what makes it...
If I would be a scientific advisor I would give my student the following problem:Take a text, take an LM, computer perplexity:file test.txt: 107247 sentences, 1.7608e+06 words, 21302 OOVs 0...
There is one thing I usually wonder about while reading the next conference paper on speech recognition. The usual paper limit is 4 pages and the authors usually want to...
LDC has recently announced availability of a very large speech database for acoustic model training. A database named Mixer 6 contains incredible amount of 15000 hours of transcribed speech data by...
Last week I've been working on PNCC features which are famous features for speech recognition by Chanwoo Kim and Richard Stern. I made quite some experiments with parameters and research around PNCC. Here...
I spent some time recently building a language model from the open Gutenberg texts, it has been released today:http://cmusphinx.sourceforge.net/2013/01/a-new-english-language-model-release/Unfortunately, it appeared that it's very hard to build a model which...
The cool project in CMUSphinx ecosystem is UniMRCP. It implements MRCP protocol and backends which allow to use speech functions from the most common telephony frameworks like Asterisk or Freeswitch.Beside...
There are quite many solutions around to build an open source speech recognition system for low-resource device and it's quite hard to choose. For example you need a speech recognition...