COST278 and ISCA Tutorial and Research Workshop (ITRW) on Robustness Issues in Conversational Interaction

University of East Anglia, Norwich, UK
August 30-31, 2004

Task Independent Speech Verification Using SB-MVE Trained Phone Models

Svein G. Pettersen, Magne H. Johnsen, Tor A. Myrvoll

Department of Electronics and Telecommunication, NTNU, Norway

Robust ASR-systems should benefit from detecting when portions of the decoded hypotheses are incorrect. This can be done by including a separate verification module based on statistical hypothesis testing. String based minimum verification error (SB-MVE) training is a promising alternative for improving the corresponding verification-models.

This paper adresses a variant of SB-MVE at the phone level for design of task independent verification modules. The algorithm updates both H0 and H1 phone models. Experiments are performed on "time of day" recordings of the Norwegian part of Speechdat (II). The results show a relative decrease in utterance error rate (compared to no verification) from 8 - 37% for false rejection rates ranging from 0 - 25%. Thus the method shows robustness with respect to choice of treshold.


Full Paper

Bibliographic reference.  Pettersen, Svein G. / Johnsen, Magne H. / Myrvoll, Tor A. (2004): "Task independent speech verification using SB-MVE trained phone models", In Robust2004, paper 10.