ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Forward optimal modeling of acoustic confusions in Mandarin CALL system

Fengpei Ge, Fuping Pan, Changliang Liu, Bin Dong, Yonghong Yan

Acoustic confusions degrade the accuracy of pronunciation assessment severely in Computer Assisted Language Learning (CALL) systems. This paper presents our recent study on optimal modeling of the acoustic confusions. We change the traditional mandarin syllable structure, which is composed of initial and final, to a novel phoneme structure. Several phoneme splitting strategies are investigated, and the question list used for building and merging decision tree is studied. The questions are special to each phoneme splitting strategy. Experiments show that the optimal phoneme splitting strategy outperforms the traditional initial-final structure in our CALL system, with relative 11.05% ASER improvement for nasal finals. This idea may be extended to improve the performance of automatic speech recognition (ASR).


doi: 10.21437/Interspeech.2008-478

Cite as: Ge, F., Pan, F., Liu, C., Dong, B., Yan, Y. (2008) Forward optimal modeling of acoustic confusions in Mandarin CALL system. Proc. Interspeech 2008, 2815-2818, doi: 10.21437/Interspeech.2008-478

@inproceedings{ge08_interspeech,
  author={Fengpei Ge and Fuping Pan and Changliang Liu and Bin Dong and Yonghong Yan},
  title={{Forward optimal modeling of acoustic confusions in Mandarin CALL system}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={2815--2818},
  doi={10.21437/Interspeech.2008-478}
}