Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

Background Model Based Posterior Probability for Measuring Confidence

Peng Liu, Ye Tian, Jian-Lai Zhou, Frank K. Soong

Microsoft Research Asia, Beijing, China

Word posterior probability (WPP) computed over LVCSR word graphs has been used successfully in measuring confidence of speech recognition output. However, for certain applications the word graph is too sparse to warrant reliable WPP estimation. In this paper, we incorporate subword units as background models to generate a subword graph for estimating posterior probability. Experiments on both English and Chinese databases show that syllable background models can repopulate the dynamic hypothesis space for effective computation of confidence measure. The resultant posterior probability confidence measure achieves 94.3% and 95.2% Out-Of-Vocabulary (OOV) word detection / rejection in English and Chinese, respectively. Correspondingly, confidence error rates are at 6.0% and 6.4%, respectively.

Full Paper

Bibliographic reference.  Liu, Peng / Tian, Ye / Zhou, Jian-Lai / Soong, Frank K. (2005): "Background model based posterior probability for measuring confidence", In INTERSPEECH-2005, 1465-1468.