INTERSPEECH 2012
13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Automatic Error Recovery for Pronunciation Dictionaries

Tim Schlippe, Sebastian Ochs, Ngoc Thang Vu, Tanja Schultz

Cognitive Systems Lab, Karlsruhe Institute of Technology (KIT), Germany

In this paper, we present our latest investigations on pronunciation modeling and its impact on ASR. We propose completely automatic methods to detect, remove, and substitute inconsistent or flawed entries in pronunciation dictionaries. The experiments were conducted on different tasks, namely (1) word-pronunciation pairs from the Czech, English, French, German, Polish, and Spanish Wiktionary [1], a multilingual wiki-based open content dictionary, (2) our GlobalPhone Hausa pronunciation dictionary [2], and (3) pronunciations to complement our Mandarin-English SEAME code-switch dictionary [3]. In the final results, we fairly observed on average an improvement of 2.0% relative in terms of word error rate and even 27.3% for the case of English Wiktionary word-pronunciation pairs.

Index Terms: pronunciation dictionaries, automatic error recovery, multilingual speech recognition

References

  1. “Wiktionary - a wiki-based open content dictionary”, Website, http://www.wiktionary.org.
  2. Schlippe, T., Komgang Djomgang, E. G., Vu, N. T., Ochs, S., and Schultz, T., “Hausa Large Vocabulary Continuous Speech Recognition”, SLTU, 2012
  3. Vu, T., Lyu, D.-C., Weiner, J., Telaar, D., Schlippe, T., Blaicher, F., Chng, E.-S., Schultz, T., and Li, H., “A First Speech Recognition System For Mandarin-English Code-Switch Conversational Speech”, ICASSP, 2012.

Full Paper

Bibliographic reference.  Schlippe, Tim / Ochs, Sebastian / Vu, Ngoc Thang / Schultz, Tanja (2012): "Automatic error recovery for pronunciation dictionaries", In INTERSPEECH-2012, 2298-2301.