11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Improvements of Search Error Risk Minimization in Viterbi Beam Search for Speech Recognition

Takaaki Hori, Shinji Watanabe, Atsushi Nakamura

NTT Corporation, Japan

This paper describes improvements in a search error risk minimization approach to fast beam search for speech recognition. In our previous work, we proposed this approach to reduce search errors by optimizing the pruning criterion. While conventional methods use heuristic criteria to prune hypotheses, our proposed method employs a pruning function that makes a more precise decision using rich features extracted from each hypothesis. The parameters of the function can be estimated to minimize a loss function based on the search error risk. In this paper, we improve this method by introducing a modified loss function, arc-averaged risk, which potentially has a higher correlation with actual error rate than the original one. We also investigate various combinations of features. Experimental results show that further search error reduction over the original method is obtained in a 100K-word vocabulary lecture speech transcription task.

Full Paper

Bibliographic reference.  Hori, Takaaki / Watanabe, Shinji / Nakamura, Atsushi (2010): "Improvements of search error risk minimization in viterbi beam search for speech recognition", In INTERSPEECH-2010, 1962-1965.