INTERSPEECH 2004 - ICSLP
This paper presents an initial effort in the area of non-native children's speech recognition by exploiting two children databases, one consisting of speech collected from native English children, the other one consisting of English sentences read by Italian learners of English in the same age range of the native speakers. First, a baseline speech recognizer for British English was trained on the corpus of native speech and applied to recognize native and non-native speech. Word error rates achieved for Italian children were 100%-600% higher than those achieved for native English children of the same age. By using a small amount of non-native speech from a group of Italian learners of English, acoustic models were adapted to this particular category of speakers. Adaptation of both context-independent and context-dependent HMMs showed to greatly improve recognition performance on non-native speech, obtaining relative reductions in word error rate up to 66.2%.
Bibliographic reference. Gerosa, Matteo / Giuliani, Diego (2004): "Investigating automatic recognition of non-native children's speech", In INTERSPEECH-2004, 1521-1524.