Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

Investigation on Mandarin Broadcast News Speech Recognition

Mei-Yuh Hwang (1), Xin Lei (1), Wen Wang (2), Takahiro Shinozaki (1)

(1) University of Washington, USA; (2) SRI International, USA

This paper describes our efforts in building a competitive Mandarin broadcast news speech recognizer. We successfully incorporated the most popular speech technologies into our system. More importantly, we present two novel algorithms in smoothing pitch features and segmenting Chinese characters into word units. Additionally, we propose to borrow the principle of pointwise mutual information for creating a Chinese word lexicon automatically. Our final system achieved 6.0% character error rate (CER) on dev04 and 16.0% on eval04, with simpler acoustic models, less training data, and simpler decoding architecture compared with other state-of-the-art systems, yet was equally competitive.

Full Paper

Bibliographic reference.  Hwang, Mei-Yuh / Lei, Xin / Wang, Wen / Shinozaki, Takahiro (2006): "Investigation on Mandarin broadcast news speech recognition", In INTERSPEECH-2006, paper 1916-Tue3A2O.3.