ISCA Archive Eurospeech 2001
ISCA Archive Eurospeech 2001

Is non-native pronunciation modelling necessary ?

Silke Goronzy, Marina Sahakyan, Wolfgang Wokurek

It is difficult to recognize non-native speech with speech recognition systems that are trained using native speech. While standard speaker adaptation techniques are often used in theses cases, they are not able to handle severe deviations from the expected pronunciation. Also, there has been a lot of interest in native pronunciation modelling recently. However, results often were not as good as expected. This paper investigates if a special treatment of non-native speakers is necessary. The effect of adding special pronunciation variants to the lexicon is examined. In contrast to native pronunciation modelling the results show that for the non-native case the enhanced dictionary is really necessary to obtain acceptable recognition rates. Recognition rates can be improved by up to 10% for German and even up to 28% for Italian learners of English. When combining this with MLLR adaptation, these results are further improved.

doi: 10.21437/Eurospeech.2001-98

Cite as: Goronzy, S., Sahakyan, M., Wokurek, W. (2001) Is non-native pronunciation modelling necessary ? Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 309-312, doi: 10.21437/Eurospeech.2001-98

  author={Silke Goronzy and Marina Sahakyan and Wolfgang Wokurek},
  title={{Is non-native pronunciation modelling necessary ?}},
  booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)},