Sixth European Conference on Speech Communication and Technology
This paper presents our recent efforts in developing a speaker independent LVCSR engine for Mandarin Chinese using our multilingual database GlobalPhone. We describe a two pass approach, in which the recognition first generates phoneme hypotheses and second transform these into Chinese character hypotheses. We show how this approach can reduce complexity and increase flexibility. We evaluate and compare different systems including different base units for speech recognition as phoneme units versus syllables. Furthermore we analyze the influence of tonal information. Our currently bestsystem shows very promising results achieving 15.0 % character error rate.
Full Paper (PDF) Gnu-Zipped Postscript
Bibliographic reference. Reichert, J. / Schultz, Tanja / Waibel, Alex (1999): "Mandarin large vocabulary speech recognition using the globalphone database", In EUROSPEECH'99, 815-818.