INTERSPEECH 2006 - ICSLP
This paper describes our efforts in building a competitive Mandarin broadcast news speech recognizer. We successfully incorporated the most popular speech technologies into our system. More importantly, we present two novel algorithms in smoothing pitch features and segmenting Chinese characters into word units. Additionally, we propose to borrow the principle of pointwise mutual information for creating a Chinese word lexicon automatically. Our final system achieved 6.0% character error rate (CER) on dev04 and 16.0% on eval04, with simpler acoustic models, less training data, and simpler decoding architecture compared with other state-of-the-art systems, yet was equally competitive.
Bibliographic reference. Hwang, Mei-Yuh / Lei, Xin / Wang, Wen / Shinozaki, Takahiro (2006): "Investigation on Mandarin broadcast news speech recognition", In INTERSPEECH-2006, paper 1916-Tue3A2O.3.