Written by
Nickolay Shmyrev
on
Recent issues
Heh, this month I discovered few critical issues in CMUSphinx.
- Pocketsphinx doesn't properly decode short silences in FSG/JSGF mode
- Sphinx4 doesn't really work with OOV loop in grammar
- Pocketsphinx n-best lists are useless because of too many repeated entries
- Pocketsphinx accuracy is way lower than sphinx3 one
- Supposedly-working sphinxbase LM stuff doesn't work with 32-bit DMP, thus no MMIE training for very large vocabulary
- MMIE itself doesn't improve accuracy (tested on Voxforge and Fisher)
- It's impossible to extract mixture_weights from recent sendumps in pocketsphinx models, python scripts in SphinxTrain are outdated
- PTM model adaptation doesn't work
- TextAligner demo from sphinx4 requires way more work to align properly
That's getting crazy, I wonder if I'll be able to find the time to fix all that.