September 22-25, 1997
This paper describes an approach to identifying the reasons that speech recognition errors occur. The algorithm presented requires an accurate word transcript of the utterances being analyzed. It places errors into one of the categories: 1) due to outof vocabulary (OOV) word spoken, 2) search error, 3) homophone substitution, 4) language model overwhelming correct acoustics, 5) transcript/pronunciation problems, 6) confused acoustic models, or 7) miscellaneous/not possible to categorize. Some categorizations of errors can supply training data to automatic corrective training methods that refine acoustic models. Other errors supply language model and lexicon designers with examples that identify potential improvements. The algorithm is described and results on the combined evaluation test sets from 19921995 of the North American Business (NAB)    corpus using the SphinxII recognizer  are presented.
Bibliographic reference. Chase, Lin (1997): "Blame assignment for errors made by large vocabulary speech recognizers", In EUROSPEECH-1997, 1563-1566.