INTERSPEECH 2010
11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Rapid Bootstrapping of Five Eastern European Languages Using the Rapid Language Adaptation Toolkit

Ngoc Thang Vu, Tim Schlippe, Franziska Kraus, Tanja Schultz

Cognitive Systems Lab, Karlsruhe Institute of Technology, Germany

This paper presents our latest efforts toward large vocabulary speech recognition systems for five Eastern European languages such as Russian, Bulgarian, Czech, Croatian and Polish using the Rapid Language Adaptation Toolkit (RLAT) [1]. We investigated the possibility of crawling large quantities of text material from the Internet, which is very cheap but also requires text post-processing steps due to the varying text quality. The goal of this study is to determine the best strategy for language model optimization on the given domain in a short time period with minimal human effort. Our results show that we can build an initial ASR system for these five languages in only ten days using RLAT. On the multilingual GlobalPhone speech corpus [2] we achieved a Word Error Rate (WER) of 16.9% for Bulgarian, 23.5% for Czech, 20.4% for Polish, 32.8% for Croatian and 36.2% for Russian.

References

  1. [1] T. Schultz and A. Black. Rapid Language Adaptation Tools and Technologies for Multilingual Speech Processing. In: Proc. ICASSP Las Vegas, NV 2008. [2] T. Schultz. GlobalPhone: A Multilingual Speech and Text Database developed at Karlsruhe University. In: Proc. ICSLP Denver, CO, 2002.

Full Paper

Bibliographic reference.  Vu, Ngoc Thang / Schlippe, Tim / Kraus, Franziska / Schultz, Tanja (2010): "Rapid bootstrapping of five eastern european languages using the rapid language adaptation toolkit", In INTERSPEECH-2010, 865-868.