ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Mispronunciation detection for Mandarin Chinese

Chao Huang, Feng Zhang, Frank K. Soong, Min Chu

In this paper, we propose several reliable weighting factors based on the speaker's proficiency level, which can be used to normalize the scaled log-posterior probability (SLPP) to further improve mispronunciation detection at syllable level for Mandarin Chinese. Experiments based on a database consisting of 8000 syllables, pronounced by 40 speakers with varied pronunciation proficiency, shows the very promising effectiveness of these normalization schemes by reducing FAR from 44.4% to 35.1% on average and greatly improving automatic mispronunciation detection (AMD) performance greatly. In addition, we have attempted to investigate and analyze underlying behavior of such normalization factors. Some modifications, extensions and possible applications of such factors in real usage cases are also discussed.

doi: 10.21437/Interspeech.2008-658

Cite as: Huang, C., Zhang, F., Soong, F.K., Chu, M. (2008) Mispronunciation detection for Mandarin Chinese. Proc. Interspeech 2008, 2655-2658, doi: 10.21437/Interspeech.2008-658

  author={Chao Huang and Feng Zhang and Frank K. Soong and Min Chu},
  title={{Mispronunciation detection for Mandarin Chinese}},
  booktitle={Proc. Interspeech 2008},