8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Combined Acoustic and Pronunciation Modelling for Non-Native Speech Recognition

G. Bouselmi, Dominique Fohr, I. Illina

LORIA, France

In this paper, we present several adaptation methods for non-native speech recognition. We have tested pronunciation modelling, MLLR and MAP non-native pronunciation adaptation and HMM models retraining on the HIWIRE foreign accented English speech database. The "phonetic confusion" scheme we have developed consists in associating to each spoken phone several sequences of confused phones. In our experiments, we have used different combinations of acoustic models representing the canonical and the foreign pronunciations: spoken and native models, models adapted to the non-native accent with MAP and MLLR. The joint use of pronunciation modelling and acoustic adaptation led to further improvements in recognition accuracy. The best combination of the above mentioned techniques resulted in a relative word error reduction ranging from 46% to 71%.

Full Paper

Bibliographic reference.  Bouselmi, G. / Fohr, Dominique / Illina, I. (2007): "Combined acoustic and pronunciation modelling for non-native speech recognition", In INTERSPEECH-2007, 1449-1452.