11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Automatic Pronunciation Scoring Using Learning to Rank and DP-Based Score Segmentation

Liang-Yu Chen, Jyh-Shing Roger Jang

National Tsing Hua University, Taiwan

This paper proposes a novel automatic pronunciation scoring framework using learning to rank. Human scores of the utterances are treated as ranks and are used as the ranking ground truths. Scores generated from various existing scoring methods are used as the features to train the learning to rank function. The output of the function is then segmented by the proposed DP-based method and hence boundaries between clusters can be used to determine the discrete computer scores. Experimental results show that the proposed framework improves upon the existing scoring methods. A non-native corpus with human ranks is also released.

Full Paper

Bibliographic reference.  Chen, Liang-Yu / Jang, Jyh-Shing Roger (2010): "Automatic pronunciation scoring using learning to rank and DP-based score segmentation", In INTERSPEECH-2010, 761-764.