Pronunciation scoring is one important task for software designed to give feedback to students practicing a second language. English intonation can convey information about a speaker's nativeness, so previous studies have proposed using intonation-based models to score nonnative pronunciation. One past approach trained models for a set of pronunciation scores using ad hoc features derived from the frequency contour. We use prosodic theory to train models for categorical intonation units, inspired by work in modeling tone languages. These HMM-based models offer 0.398 correlation between automatic and listener scores on the ISLE nonnative speech corpus, compared to the 0.156 baseline correlation.
Bibliographic reference. Tepperman, Joseph / Narayanan, Shrikanth S. (2008): "Better nonnative intonation scores through prosodic theory", In INTERSPEECH-2008, 1813-1816.