10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Simultaneous Estimation of Confidence and Error Cause in Speech Recognition Using Discriminative Model

Atsunori Ogawa, Atsushi Nakamura

NTT Corporation, Japan

Since recognition errors are unavoidable in speech recognition, confidence scoring, which accurately estimates the reliability of recognition results, is a critical function for speech recognition engines. In addition to achieving accurate confidence estimation, if we are to develop speech recognition systems that will be widely used by the public, speech recognition engines must be able to report the causes of errors properly, namely they must offer a reason for any failure to recognize input utterances. This paper proposes a method that simultaneously estimates both confidences and causes of errors in speech recognition results by using discriminative models. We evaluated the proposed method in an initial speech recognition experiment, and confirmed its promising performance with respect to confidence and error cause estimation.

Full Paper

Bibliographic reference.  Ogawa, Atsunori / Nakamura, Atsushi (2009): "Simultaneous estimation of confidence and error cause in speech recognition using discriminative model", In INTERSPEECH-2009, 1199-1202.