Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Pronunciation Variants Description Using Recognition Error Modeling with Phonetic Derivation Hypotheses

Hideharu Nakajima, Yoshinori Sagisaka, Hirofumi Yamamoto

ATR Spoken Language Translation Research Labs., Seika-cho, Soraku-gun, Kyoto, Japan

This paper proposes a new method of pronunciation variant generation for reducing word error rate in conversational speech recognition. In particular, this paper focuses on the generation of alternative pronunciations from canonical forms by using the phonological knowledge derived from the analysis of a phonetic transcription corpus. The experimental results show that the pronunciation variation generated by the proposed method provides slightly better performance than a method based on manually written pronunciation. These results also demonstrate the applicability of phonological knowledge-based generation of pronunciation variation.

Keywords: speech variants, multiple pronunciation generation, phonological knowledge, corpus based approach


Full Paper

Bibliographic reference.  Nakajima, Hideharu / Sagisaka, Yoshinori / Yamamoto, Hirofumi (2000): "Pronunciation variants description using recognition error modeling with phonetic derivation hypotheses", In ICSLP-2000, vol.3, 1093-1096.