Phoneme Set Design Considering Integrated Acoustic and Linguistic Features of Second Language Speech

Xiaoyun Wang, Tsuneo Kato, Seiichi Yamamoto


Recognition of second language speech is still a challenging task even for state-of-the-art automatic speech recognition (ASR) systems. Considering that second language speech usually includes less fluent pronunciation and mispronunciation even when it is grammatically correct, we propose a novel phonetic decision tree (PDT) method considering integrated acoustic and linguistic features to derive the phoneme set for second language speech recognition. We verify the efficacy of the proposed method using second language speech collected with a translation game type dialogue-based English CALL system. Experimental results demonstrated that the derived phoneme set achieved higher accuracy recognition performance than the canonical one.


DOI: 10.21437/Interspeech.2016-663

Cite as

Wang, X., Kato, T., Yamamoto, S. (2016) Phoneme Set Design Considering Integrated Acoustic and Linguistic Features of Second Language Speech. Proc. Interspeech 2016, 3132-3136.

Bibtex
@inproceedings{Wang+2016,
author={Xiaoyun Wang and Tsuneo Kato and Seiichi Yamamoto},
title={Phoneme Set Design Considering Integrated Acoustic and Linguistic Features of Second Language Speech},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-663},
url={http://dx.doi.org/10.21437/Interspeech.2016-663},
pages={3132--3136}
}