8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Attribute-Based Mandarin Speech Recognition Using Conditional Random Fields

Chi-Yueh Lin, Hsiao-Chuan Wang

National Tsing-Hua University, Taiwan

Integrating phonetic knowledge into a speech recognizer is a possible way to further improve the performance of conventional HMM-based speech recognition methods. This paper presents a cascaded architecture which consists of attribute detection and conditional random field to make use of phonetic knowledge within the phone decoding process. The attribute detection can be implemented by using any effective feature extraction approaches. In this study, an HMM-based method is applied for attribute tagging of Mandarin speech. Then a conditional random field method which applies attribute labels as the input vectors is used to perform the speech recognition. The preliminary experiment result shows that the proposed method is very promising and worthy for further investigation.

Full Paper

Bibliographic reference.  Lin, Chi-Yueh / Wang, Hsiao-Chuan (2007): "Attribute-based Mandarin speech recognition using conditional random fields", In INTERSPEECH-2007, 1833-1836.