ISCA Archive SLaTE 2019
ISCA Archive SLaTE 2019

Noise robust goodness of pronunciation measures using teacher's utterance

Sweekar Sudhakara, Manoj Kumar Ramanathi, Chiranjeevi Yarra, Anurag Das, Prasanta Kumar Ghosh

In the applications of computer-aided pronunciation training (CAPT), evaluation of second language learner's pronunciation is an important task. For this task, goodness of pronunciation (GoP) is shown to be effective and is typically computed under clean speech conditions. However, in real scenarios, CAPT systems often need to deal with noisy conditions, which could degrade the effectiveness of GoP. We analyze the variations in GoP performance under noisy conditions by adding three types of noises namely, babble, white and f-16 at 20 dB, 10 dB and 0 dB signal-to-noise ratio (SNR) conditions. We hypothesize that the use of phonemes uttered by a teacher would make GoP score more robust and mimic the human rating closely, based on which we propose a modification to the typical lexicon based GoP (LGoP). The proposed scheme is referred as teacher utterance based GoP (TGoP). In addition, GoP of learner's and teacher's utterances are combined to propose a GoP like (GL) score based on the difference between the two. Correlation coefficient between the GoPs and the teacher's ratings is used as the performance metric. Experiments conducted on the speech data collected from Indian English learners reveal that, although the performance of different GoP schemes drops with additive noise, TGoP performs better than LGoP in both clean and noisy conditions. In low SNR conditions, GL performs better than both TGoP and LGoP.


doi: 10.21437/SLaTE.2019-13

Cite as: Sudhakara, S., Ramanathi, M.K., Yarra, C., Das, A., Ghosh, P.K. (2019) Noise robust goodness of pronunciation measures using teacher's utterance. Proc. 8th ISCA Workshop on Speech and Language Technology in Education (SLaTE 2019), 69-73, doi: 10.21437/SLaTE.2019-13

@inproceedings{sudhakara19_slate,
  author={Sweekar Sudhakara and Manoj Kumar Ramanathi and Chiranjeevi Yarra and Anurag Das and Prasanta Kumar Ghosh},
  title={{Noise robust goodness of pronunciation measures using teacher's utterance}},
  year=2019,
  booktitle={Proc. 8th ISCA Workshop on Speech and Language Technology in Education (SLaTE 2019)},
  pages={69--73},
  doi={10.21437/SLaTE.2019-13}
}