ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Simultaneous estimation of confidence and error cause in speech recognition using discriminative model

Atsunori Ogawa, Atsushi Nakamura

Since recognition errors are unavoidable in speech recognition, confidence scoring, which accurately estimates the reliability of recognition results, is a critical function for speech recognition engines. In addition to achieving accurate confidence estimation, if we are to develop speech recognition systems that will be widely used by the public, speech recognition engines must be able to report the causes of errors properly, namely they must offer a reason for any failure to recognize input utterances. This paper proposes a method that simultaneously estimates both confidences and causes of errors in speech recognition results by using discriminative models. We evaluated the proposed method in an initial speech recognition experiment, and confirmed its promising performance with respect to confidence and error cause estimation.


doi: 10.21437/Interspeech.2009-347

Cite as: Ogawa, A., Nakamura, A. (2009) Simultaneous estimation of confidence and error cause in speech recognition using discriminative model. Proc. Interspeech 2009, 1199-1202, doi: 10.21437/Interspeech.2009-347

@inproceedings{ogawa09_interspeech,
  author={Atsunori Ogawa and Atsushi Nakamura},
  title={{Simultaneous estimation of confidence and error cause in speech recognition using discriminative model}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={1199--1202},
  doi={10.21437/Interspeech.2009-347}
}