Testing ASR with Voxforge Database

April 18, 2010

In development and research the critical issue is proper testing. There was some buzz about that recently, for example at MLoss blog where pros for using open data are considered....

Finally Sorted Out Workshop Materials

March 25, 2010

Since they are more CMUSphinx official documents, I posted notes about workshop and meetins after that on the website:Sphinx Users And Developers Workshop 2010 ResultsDevelopment Meeting NotesI'm pleased to get...

ICASSP 2010

March 22, 2010

So, I'm back from ICASSP in Dallas, TX. It was very impressive conference with lots of interesting and inspiring presentations, meetings and discussions. Amazing everyone was there and I've finally...

Sphinx4 1.0 beta4 Is Released. What's next?

March 01, 2010

So, almost according to schedule, sphinx4 was released yesterday. Check the notes athttp://cmusphinx.sourceforge.net/2010/03/sphinx4-1-0-beta-4-released/Most notable improvements were already discussed here, so let me try to plan what the next release will...

Speech Recognition in GSoC Done Right

February 27, 2010

From year to year many end-user projecs are trying to push ASR with the help of Google and studens of the Summer Of Code program. If CMUSphinx team knows all...

Noise reduction filtering in sphinx4

February 12, 2010

There is a huge gap between stock sphinx4 and real ASR system since critical parts like noise filtering, speaker diarization and postprocessing are missing. Not to mention the online adaptation....

All ideas are already generated

January 31, 2010

After seeing flash websites take enormous amount of my CPU got a cool idea today about using flash for distributed computing. Basically everything is already in place. You setup webserver,...

Training process

January 31, 2010

What I really like in Sphinxtrain is that it provides straightforward way for training an audio model. It remains unclear for me why everyone bothers with HTKBook while there is...

Moving Beyond the `Beads-On-A-String'

January 21, 2010

Recently I've got interested in quite a large domain of speech recognition research where old school linguistic meets modern speech recognition. Basically the idea is that in spontaneous speech variativity...

Three Generation of IVR Systems

January 16, 2010

Recently I invented new nice concept for marketing people. Basicallly there are three generations of IVR systems right now:Generation 1.0 - Static systems based on VoiceXML. It was suprising for...