Two methods of assessing the acoustic modelling of an automatic speech recognition system are presented. The first is objective, and is based on determination of the phone substitution matrix. For this, a generalized alignment procedure is introduced, which leads to better results. The second method is subjective. Hypothesis words, as segmented by the recognizer, are evaluated by test subjects on their validity. Without additional context, humans accept about half of the false alarms found by a word spotter.
Cite as: Leeuwen, D.A.v., Louwere, M.d. (1999) Objective and subjective evaluation of the acoustic models of a continuous speech recognition system. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 1915-1918, doi: 10.21437/Eurospeech.1999-420
@inproceedings{leeuwen99_eurospeech, author={David A. van Leeuwen and Michael de Louwere}, title={{Objective and subjective evaluation of the acoustic models of a continuous speech recognition system}}, year=1999, booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)}, pages={1915--1918}, doi={10.21437/Eurospeech.1999-420} }