ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

A statistical method of evaluating pronunciation proficiency for Japanese words

Kei Ohta, Seiichi Nakagawa

In this paper, we propose a statistical method of evaluating the pronunciation proficiency of Japanese words. We analyze statistically the utterances to note a combination that has a high correlation between a Japanese teacher's score and certain acoustic features. We found that the syllable recognition rates (accuracy) was the best measure of pronunciation proficiency. The effective measure which was highly correlated with Japanese teacher's score was the combination of the posteriori probability, the substitution/accuracy rates and the standard deviation of mora lengths. We obtained a correlation coefficient of 0.712 with closed data and 0.591 with open data for speaker at the five words set level, respectively. The coefficient was near the correlation between humans' scores, 0.600.


doi: 10.21437/Interspeech.2005-358

Cite as: Ohta, K., Nakagawa, S. (2005) A statistical method of evaluating pronunciation proficiency for Japanese words. Proc. Interspeech 2005, 2233-2236, doi: 10.21437/Interspeech.2005-358

@inproceedings{ohta05_interspeech,
  author={Kei Ohta and Seiichi Nakagawa},
  title={{A statistical method of evaluating pronunciation proficiency for Japanese words}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={2233--2236},
  doi={10.21437/Interspeech.2005-358}
}