In this paper, we describe the design of an ASR system that is based on identifying and extracting formulaic phrases from a corpus and then, rather than building statistical models of them, performing example-based recognition of these phrases. We describe a method for combining formulaic phrases into a bigram language model that results in a 13% decrease in WER on a monophone HMM recogniser over the baseline. We show that using this model with phrase templates in the example-based recogniser gives a significant improvement in WER compared to word templates, but performance still falls short of the HMM recogniser. We also describe an LDA decision tree classifier that reduces the search space of the DTW decoder by 40% while at the same time decreasing WER.
Cite as: Watkins, C.J., Cox, S.J. (2009) Example-based speech recognition using formulaic phrases. Proc. Interspeech 2009, 3043-3046, doi: 10.21437/Interspeech.2009-564
@inproceedings{watkins09_interspeech, author={Christopher J. Watkins and Stephen J. Cox}, title={{Example-based speech recognition using formulaic phrases}}, year=2009, booktitle={Proc. Interspeech 2009}, pages={3043--3046}, doi={10.21437/Interspeech.2009-564} }