11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Unsupervised Acoustic Model Adaptation for Multi-Origin Non Native ASR

Sethserey Sam (1), Eric Castelli (2), Laurent Besacier (1)

(1) LIG, France
(2) MICA, Vietnam

To date, the performance of speech and language recognition systems is poor on non-native speech. The challenge for non-native speech recognition is to maximize the accuracy of a speech recognition system when only a small amount of non-native data is available. We report on the acoustic model adaptation for improving the recognition of non-native speech in English, French and Vietnamese, spoken by speakers of different origins. Using online unsupervised adaptation acoustic modeling without any additional data for adapting purposes, we investigate how an unsupervised multilingual acoustic model interpolation method can help to improve the phone accuracy of the system. Results improvement of 7% of absolute phone level accuracy (PLA) obtained from the experiments demonstrate the feasibility of the method.

Full Paper

Bibliographic reference.  Sam, Sethserey / Castelli, Eric / Besacier, Laurent (2010): "Unsupervised acoustic model adaptation for multi-origin non native ASR", In INTERSPEECH-2010, 254-257.