Sixth European Conference on Speech Communication and Technology
(EUROSPEECH'99)

Budapest, Hungary
September 5-9, 1999

Mandarin Large Vocabulary Speech Recognition Using the GlobalPhone Database

J. Reichert, Tanja Schultz, Alex Waibel

Interactive Systems Laboratories, University of Karlsruhe, Germany, and
Carnegie Mellon University, Pittsburgh, PA, USA

This paper presents our recent efforts in developing a speaker independent LVCSR engine for Mandarin Chinese using our multilingual database GlobalPhone. We describe a two pass approach, in which the recognition first generates phoneme hypotheses and second transform these into Chinese character hypotheses. We show how this approach can reduce complexity and increase flexibility. We evaluate and compare different systems including different base units for speech recognition as phoneme units versus syllables. Furthermore we analyze the influence of tonal information. Our currently bestsystem shows very promising results achieving 15.0 % character error rate.


Full Paper (PDF)   Gnu-Zipped Postscript

Bibliographic reference.  Reichert, J. / Schultz, Tanja / Waibel, Alex (1999): "Mandarin large vocabulary speech recognition using the globalphone database", In EUROSPEECH'99, 815-818.