Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Improvements of the Philips 2000 Taiwan Mandarin Benchmark System

Yuan-Fu Liao (1), Nick Wang (1,2), Max Huang (1), Hank Huang (1), Frank Seide (1)

(1) Philips Research East-Asia, Taipei, Taiwan
(2) Graduate Institute of Communication Engineering, Taiwan University, Taiwan

In this paper, we present the Philips large vocabulary continuous Mandarin speech recognition system developed for the 2000 Taiwan Speech Input Technology Assessment. We systematically integrated key Mandarin components with up-todate Western-language techniques to build up a state-of-the-art Mandarin speech recognition system. These technologies include robust pitch extraction/tone modeling, context-dependent preme/core-final units, Chinese phrase/syllable trigram language model, linear discriminant analysis (LDA), cross-syllable modeling/decoding, speaker clustering and maximum likelihood linear regression (MLLR) adaptation. Among them, the major breakthroughs were our robust pitch extraction/tone modeling technology and the treatment of coarticulation across syllable boundaries. For the development set, we dramatically reduced last year’s best error rates by relative 44.8%~67.8% on all three categories we participated. Moreover, for the evaluation set, we achieved the lowest unit error rates on all three categories.


Full Paper

Bibliographic reference.  Liao, Yuan-Fu / Wang, Nick / Huang, Max / Huang, Hank / Seide, Frank (2000): "Improvements of the Philips 2000 Taiwan Mandarin benchmark system", In ICSLP-2000, vol.4, 298-301.