11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Improving Back-Off Models with Bag of Words and Hollow-Grams

Benjamin Lecouteux, Raphaël Rubino, Georges Linarès

LIA, France

Classical n-grams models lack robustness on unseen events. The literature suggests several smoothing methods: empirically, the most effective of these is the modified Kneser-Ney approach. We propose to improve this back-off model: our method boils down to back-off value reordering, according to the mutual information of the words, and to a new hollow-gram model. Results show that our back-off model yields significant improvements to the baseline, based on the modified Kneser-Ney back-off. We obtain a 0.6% absolute word error rate improvement without acoustic adaptation, and 0.4% after adaptation with a 3xRT ASR system.

Full Paper

Bibliographic reference.  Lecouteux, Benjamin / Rubino, Raphaël / Linarès, Georges (2010): "Improving back-off models with bag of words and hollow-grams", In INTERSPEECH-2010, 2418-2421.