9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Nonnative Speech Recognition Based on State-Candidate Bilingual Model Modification

Qingqing Zhang, Ta Li, Jielin Pan, Yonghong Yan

Chinese Academy of Sciences, China

The speech recognition accuracy has been observed to decrease for nonnative speakers, especially those who are just beginning to learn foreign language or who have heavy accents. This paper presents a novel bilingual model modification approach to improve nonnative speech recognition, considering these great variations of accented pronunciations. Each state of the baseline nonnative acoustic models is modified with several candidate states from the auxiliary acoustic models, which are trained by speakers' mother language. State mapping criterion and n-best candidates are investigated based on a grammar-constrained speech recognition system. Using the state-candidate bilingual model modification approach, compared to the nonnative acoustic models which have already been well trained by adaptation technique MAP, a Relative reduction of 7.87% in Phrase Error Rate (RPhrER) was further achieved.

Full Paper

Bibliographic reference.  Zhang, Qingqing / Li, Ta / Pan, Jielin / Yan, Yonghong (2008): "Nonnative speech recognition based on state-candidate bilingual model modification", In INTERSPEECH-2008, 2366-2369.