10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Optimizing Non-Native Speech Recognition for CALL Applications

Joost van Doremalen, Helmer Strik, Catia Cucchiarini

Radboud Universiteit Nijmegen, The Netherlands

We are developing a Computer Assisted Language Learning (CALL) system for practicing oral proficiency that makes use of Automatic Speech Recognition (ASR) to provide feedback on grammar and pronunciation. Since good quality unconstrained non-native ASR is not yet feasible, we use an approach in which we try to elicit constrained responses. The task in the current experiments is to select utterances from a list of responses. The results of our experiments show that significant improvements can be obtained by optimizing the language model and the acoustic models, thus reducing the utterance error rate from 2926% to 108%.

Full Paper

Bibliographic reference.  Doremalen, Joost van / Strik, Helmer / Cucchiarini, Catia (2009): "Optimizing non-native speech recognition for CALL applications", In INTERSPEECH-2009, 592-595.