A sharable software repository for Japanese LVCSR (Large Vocabulary Continuous Speech Recognition) is introduced. It is designed as a baseline platform for research and developed by researchers of different academic institutes under a governmental support. The repository consists of a recognition engine (Julius), Japanese acoustic models and statistical language models as well as Japanese morphological analysis tools. These modules can be easily integrated and replaced under a plug-and-play framework, which makes it possible to fairly evaluate components and to develop specific application systems. Assessment of these modules and systems in a 20000-word dictation task is reported. The software repository is freely available to the public.
Cite as: Kawahara, T., Lee, A., Kobayashi, T., Takeda, K., Minematsu, N., Sagayama, S., Itou, K., Ito, A., Yamamoto, M., Yamada, A., Utsuro, T., Shikano, K. (2000) Free software toolkit for Japanese large vocabulary continuous speech recognition. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 4, 476-479, doi: 10.21437/ICSLP.2000-852
@inproceedings{kawahara00_icslp, author={Tatsuya Kawahara and Akinobu Lee and Tetsunori Kobayashi and Kazuya Takeda and Nobuaki Minematsu and Shigeki Sagayama and Katsunobu Itou and Akinori Ito and Mikio Yamamoto and Atsushi Yamada and Takehito Utsuro and Kiyohiro Shikano}, title={{Free software toolkit for Japanese large vocabulary continuous speech recognition}}, year=2000, booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)}, pages={vol. 4, 476-479}, doi={10.21437/ICSLP.2000-852} }