ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Improving pronunciation modeling for non-native speech recognition

Tien-Ping Tan, Laurent Besacier

In this paper, three different approaches to pronunciation modeling are investigated. Two existing pronunciation modeling approaches, namely the pronunciation dictionary and n-best rescoring approach are modified to work with little amount of non-native speech. We also propose a speaker clustering approach, which capable of grouping the speakers based on their pronunciation habits. Given some speech, the approach can also be used for pronunciation adaptation. This approach is called latent pronunciation analysis. The results show that conventional pronunciation dictionary perform slightly better than n-best list rescoring, while the latent pronunciation analysis has shown to be beneficial for speaker clustering, and it can produce nearly the same improvement as the pronunciation dictionary approach, without the need to know the origin of the speaker.

doi: 10.21437/Interspeech.2008-495

Cite as: Tan, T.-P., Besacier, L. (2008) Improving pronunciation modeling for non-native speech recognition. Proc. Interspeech 2008, 1801-1804, doi: 10.21437/Interspeech.2008-495

  author={Tien-Ping Tan and Laurent Besacier},
  title={{Improving pronunciation modeling for non-native speech recognition}},
  booktitle={Proc. Interspeech 2008},