This paper describes a frame-period adaptation method for speaking rate robust speech recognition. The proposed method determines an appropriate frame-period for each phrase by measuring its speaking rate or computing the acoustic likelihood with a set of frame-periods. Experimental results on spontaneous speech recognition show that the proposed method is effective for slower utterance. Actually, we can get about a 15% error reduction in error rate for slower utterance by using the likelihood based frame-period determination.
Cite as: Tsuge, S., Fukada, T., Kita, K. (2000) Frame-period adaptation for speaking rate robust speech recognition. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 718-721, doi: 10.21437/ICSLP.2000-636
@inproceedings{tsuge00_icslp, author={Satoru Tsuge and Toshiaki Fukada and Kenji Kita}, title={{Frame-period adaptation for speaking rate robust speech recognition}}, year=2000, booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)}, pages={vol. 3, 718-721}, doi={10.21437/ICSLP.2000-636} }