This paper proposes a new method of pronunciation variant generation for reducing word error rate in conversational speech recognition. In particular, this paper focuses on the generation of alternative pronunciations from canonical forms by using the phonological knowledge derived from the analysis of a phonetic transcription corpus. The experimental results show that the pronunciation variation generated by the proposed method provides slightly better performance than a method based on manually written pronunciation. These results also demonstrate the applicability of phonological knowledge-based generation of pronunciation variation.
Keywords: speech variants, multiple pronunciation generation, phonological knowledge, corpus based approach
Cite as: Nakajima, H., Sagisaka, Y., Yamamoto, H. (2000) Pronunciation variants description using recognition error modeling with phonetic derivation hypotheses. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 1093-1096, doi: 10.21437/ICSLP.2000-726
@inproceedings{nakajima00_icslp, author={Hideharu Nakajima and Yoshinori Sagisaka and Hirofumi Yamamoto}, title={{Pronunciation variants description using recognition error modeling with phonetic derivation hypotheses}}, year=2000, booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)}, pages={vol. 3, 1093-1096}, doi={10.21437/ICSLP.2000-726} }