ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Exploiting Chinese character models to improve speech recognition performance

J. L. Hieronymus, X. Liu, M. J. F. Gales, P. C. Woodland

The Chinese language is based on characters which are syllabic in nature. Since languages have syllabotactic rules which govern the construction of syllables and their allowed sequences, Chinese character sequence models can be used as a first level approximation of allowed syllable sequences. N-gram character sequence models were trained on 4.3 billion characters. Characters are used as a first level recognition unit with multiple pronunciations per character. For comparison the CU-HTK Mandarin word based system was used to recognize words which were then converted to character sequences. The character only system error rates for one best recognition were slightly worse than word based character recognition. However combining the two systems using log-linear combination gives better results than either system separately. An equally weighted combination gave consistent CER gains of 0.1–0.2% absolute over the word based standard system.

doi: 10.21437/Interspeech.2009-116

Cite as: Hieronymus, J.L., Liu, X., Gales, M.J.F., Woodland, P.C. (2009) Exploiting Chinese character models to improve speech recognition performance. Proc. Interspeech 2009, 364-367, doi: 10.21437/Interspeech.2009-116

  author={J. L. Hieronymus and X. Liu and M. J. F. Gales and P. C. Woodland},
  title={{Exploiting Chinese character models to improve speech recognition performance}},
  booktitle={Proc. Interspeech 2009},