Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Free Software Toolkit for Japanese Large Vocabulary Continuous Speech Recognition

Tatsuya Kawahara (1), Akinobu Lee (1), Tetsunori Kobayashi (2), Kazuya Takeda (3), Nobuaki Minematsu (4), Shigeki Sagayama (4), Katsunobu Itou (5), Akinori Ito (6), Mikio Yamamoto (7), Atsushi Yamada (8), Takehito Utsuro (9), Kiyohiro Shikano (10)

(1) School of Informatics, Kyoto University; (2) Waseda University, Tokyo; (3) Nagoya University; (4) Tokyo University; (5) ETL; (6) Yamagata University; (7) Yamagata University; (8) ASTEM; (9) Toyohashi University of Technology; (10) Nara Institute of Science and Technology; Japan

A sharable software repository for Japanese LVCSR (Large Vocabulary Continuous Speech Recognition) is introduced. It is designed as a baseline platform for research and developed by researchers of different academic institutes under a governmental support. The repository consists of a recognition engine (Julius), Japanese acoustic models and statistical language models as well as Japanese morphological analysis tools. These modules can be easily integrated and replaced under a plug-and-play framework, which makes it possible to fairly evaluate components and to develop specific application systems. Assessment of these modules and systems in a 20000-word dictation task is reported. The software repository is freely available to the public.

Full Paper

Bibliographic reference.  Kawahara, Tatsuya / Lee, Akinobu / Kobayashi, Tetsunori / Takeda, Kazuya / Minematsu, Nobuaki / Sagayama, Shigeki / Itou, Katsunobu / Ito, Akinori / Yamamoto, Mikio / Yamada, Atsushi / Utsuro, Takehito / Shikano, Kiyohiro (2000): "Free software toolkit for Japanese large vocabulary continuous speech recognition", In ICSLP-2000, vol.4, 476-479.