First International Conference on Spoken Language Processing (ICSLP 90)
SRI is developing a system that uses real time speech recognition to diagnose, evaluate and provide training in spoken English. The paper first describes the methods and results of a study of the feasibility of automatically grading the performance of Japanese students when reading English aloud. Utterances recorded from Japanese speakers were independently rated by expert listeners. Speech grading software was developed from a speaker independent hidden-Markov-model speech recognition system. The automatic grading procedure first aligned the speech with a model and then compared the segments of the speech signal with models of those segments that have been developed from a database of speech from native speakers of English. The evaluation study showed that ratings of speech quality by experts are very reliable and automatic grades correlate well (r > 0.8) with those expert ratings. SRI is now extending this technology and integrating it in a spoken-language training system. This effort involves (1) porting SRI's DECIPHER speech recognition system to a microcomputer platform, and (2) extending the speech-evaluation software to more exactly diagnose a learner's pronunciation deficits and lead the learner through an appropriate regimen of exercises.
Bibliographic reference. Bernstein, Jared / Cohen, Michael / Murveit, Hy / Rtischev, Dimitry / Weintraub, Mitchel (1990): "Automatic evaluation and training in English pronunciation", In ICSLP-1990, 1185-1188.