September 22-25, 1997
This paper describes three experiments in using frame level observation probabilities as the basis for word confidence annotation in an HMM speech recognition system. One experiment is at the word level, one uses word classes, and the other uses phone classes. In each experiment we categorize hypotheses into correct and incorrect categories by aligning a best recognition hypothesis with the known transcript. The confidence of error prediction for each class is a measure of the resolvability between the correct and incorrect histograms.
Bibliographic reference. Bergen, Zachary / Ward, Wayne (1997): "A senone based confidence measure for speech recognition", In EUROSPEECH-1997, 819-822.