Context Aware Mispronunciation Detection for Mandarin Pronunciation Training

Rong Tong, Nancy F. Chen, Bin Ma, Haizhou Li


Mispronunciation detection is an important component in a computer-assisted language learning (CALL) system. Many CALL systems only provide pronunciation correctness as the single feedback, which is not very informative for language learners. This paper proposes a context aware multilayer framework for Mandarin mispronunciation detection. The proposed framework incorporates the context information in the detection process and providing phonetic, tonal and syllabic level feedback. In particular, the contribution of this work is twofold: 1) we propose to use a multilayer mispronunciation detection architecture to detect and provide mispronunciation feedback at the phonetic, tonal and syllabic levels. 2) we propose to incorporate the phonetic and tone context information in mispronunciation detection using vector space modelling. Our experiment results show that the proposed framework improves the mispronunciation detection performance in all three levels.


DOI: 10.21437/Interspeech.2016-289

Cite as

Tong, R., Chen, N.F., Ma, B., Li, H. (2016) Context Aware Mispronunciation Detection for Mandarin Pronunciation Training. Proc. Interspeech 2016, 3112-3116.

Bibtex
@inproceedings{Tong+2016,
author={Rong Tong and Nancy F. Chen and Bin Ma and Haizhou Li},
title={Context Aware Mispronunciation Detection for Mandarin Pronunciation Training},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-289},
url={http://dx.doi.org/10.21437/Interspeech.2016-289},
pages={3112--3116}
}