When Language Models Fail

July 26, 2011

Language modeling still have many challenging problems.Comic by Jim Benton

Decoders And Features

July 23, 2011

CMUSphinx decoders in a glance, so one can compare. Table is incomplete and imprecise of course. sphinx2 sphinx3 sphinx4 pocketsphinx Acoustic Lookahead - - + - Alignment + + +...

Cars Controlled By Speech

June 28, 2011

Being a speech recognition guy I'm looking for a car with speech recognition included. Sounds strange to select car just because of it, but that is just kidding. So far...

ICASSP 2011 Part 1 - Thoughts

June 21, 2011

It seems like ICASSP this year was a great event, it is pity I missed it. Just comparing the keynotes list, ICASSP beats Interspeech 4:0. ICASSP is very technical, Interspeech is...

Chicken-And-Egg in Sphinxbase

May 02, 2011

Recently Shea Levy pointed me to an issue with a verbose output during pocketsphinx initialization. Basically every time you start pocketsphinx, you get something likeINFO: cmd_ln.c(691): Parsing command line:pocketsphinx_continuous Current configuration:[NAME]...

Voicemail transcription with Pocketsphinx and Asterisk (Part 2)

April 26, 2011

This is a second part which describes voicemail transcription for Asterisk administrators. See previous part which describes how to setup Pocketsphinx hereSo you have configured the recognizer to transcribe voicemails...

CMUSphinx accepted at Google Summer Of Code 2011

March 19, 2011

So we are in. Great to know that. For more information seehttp://cmusphinx.sourceforge.net/2011/03/cmusphinx-at-gsoc-2011/I think it's a big responsibility and a big opportunity as well. Of course we don't consider this as...

Fillers in WFST

March 13, 2011

Another practical question is - how do you integrate fillers? There is silence class introduced in A GENERALIZED CONSTRUCTION OF INTEGRATED SPEECH RECOGNITION TRANSDUCERS by Cyril Allauzen, Mehryar Mohri, Michael...

Word position context dependency of Sphinxtrain and WFST

March 03, 2011

Interesting thing about Sphinxtrain models is that it uses word position as a context when looking for a senone for a particular word sequence. That means that in theory a...

Openfst troubleshooting

February 21, 2011

A bit of openfst troubleshooting when you try to build WFST with Juicer. Say you are runningfstcompose ${OUTLEXBFSM} ${OUTGRAMBFSM} | \fstepsnormalize | \fstdeterminize | \fstencode --encode_labels - $CODEX | \fstminimize...