There is a limit of recognition performance for dialogue speech using acoustic models built only with read speech, because various acoustic and linguistic phenomena, which reflect the characteristics of spontaneous speech, are observed in the dialogue speech. In this paper, we inves- tigated the dierences of acoustic properties which cause the limit among isolated words, read speech and spontaneous speech. Firstly, the dialogue speech was compared with the read speech through acoustic analyses. Next, the acoustic models were separately built with each of the speech databases. The recognition performance was experimentally evaluated using the acoustic models and the relations of the dierences of the performance to those of the acoustic features observed in the analyses were investigated quantitatively. The eectiveness of speaker adaptation was also investigated in the same manner.
Cite as: Yamamoto, K., Nakagawa, S. (2000) Relationship among speaking style, inter-phoneme's distance and speech recognition performance. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 2, 859-862, doi: 10.21437/ICSLP.2000-405
@inproceedings{yamamoto00c_icslp, author={Kazumasa Yamamoto and Seiichi Nakagawa}, title={{Relationship among speaking style, inter-phoneme's distance and speech recognition performance}}, year=2000, booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)}, pages={vol. 2, 859-862}, doi={10.21437/ICSLP.2000-405} }