Speech recognition of foreign accented speech is one of the most difficult tasks in ASR. The problem of foreign accent is addressed in this study using acoustic models of the target language phonemes (French phonemes in our case) adapted with speech data from 3 other languages: English (US and UK), German and Spanish. Recognition results obtained for 11 language groups of speakers show that error rate can be significantly reduced when standard acoustic models of phonemes are adapted using speech data from other languages. Phonological rules are also introduced into the standard phonetic description of the lexical units to account for some foreign accent pronunciation variants. It appears that using phonological rules together with foreign language adapted acoustic units provides the best recognition performance. The highest error rate reduction (40%) is obtained on English speakers.
Cite as: Bartkova, K., Jouvet, D. (2004) Foreign accent processing in automatic speech recognition. Proc. 9th Conference on Speech and Computer (SPECOM 2004), 22-28
@inproceedings{bartkova04_specom, author={Katarina Bartkova and Denis Jouvet}, title={{Foreign accent processing in automatic speech recognition}}, year=2004, booktitle={Proc. 9th Conference on Speech and Computer (SPECOM 2004)}, pages={22--28} }