INTERSPEECH 2010
11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Spoken English Assessment System for Non-Native Speakers Using Acoustic and Prosodic Features

Qin Shi (1), Kun Li (2), ShiLei Zhang (1), Stephen M. Chu (3), Ji Xiao (2), ZhiJian Ou (2)

(1) IBM Research, China
(2) Tsinghua University, China
(3) IBM T.J. Watson Research Center, USA

The absence of real-time and targeted feedback is often critical in spoken foreign language learning. Computer-assisted language assessment systems are playing an ever more important role in this domain. This work considers the idiosyncratic pronunciation patterns of Chinese English speakers and uses both acoustic and prosody features to capture pronunciation, word stress, and rhythm information. The proposed system uses a. automatic speech recognition and alignment for pronunciation assessment, b. a set of special features with appropriate normalization for word stress detection, and c. a prosody phrase prediction model for rhythm assessment; and is shown to give immediate and accurate analyses to speakers to improve learning efficiency.

Full Paper

Bibliographic reference.  Shi, Qin / Li, Kun / Zhang, ShiLei / Chu, Stephen M. / Xiao, Ji / Ou, ZhiJian (2010): "Spoken English assessment system for non-native speakers using acoustic and prosodic features", In INTERSPEECH-2010, 1874-1877.