ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Automatic transcription system for meetings of the Japanese national congress

Yuya Akita, Masato Mimura, Tatsuya Kawahara

This paper presents an automatic speech recognition (ASR) system for assisting meeting record creation of the National Congress of Japan. The system is designed to cope with spontaneous characteristics of meeting speech, as well as a variety of topics and speakers. For acoustic model, minimum phone error (MPE) training is applied with several normalization techniques. For language model, we have proposed statistical style transformation to generate spoken-style N-grams and their statistics. We also introduce statistical modeling of pronunciation variation in spontaneous speech. The ASR system was evaluated on real congressional meetings, and achieved word accuracy of 84%. It is also suggested that the ASR-based transcripts with this accuracy level is usable for editing meeting records.


doi: 10.21437/Interspeech.2009-19

Cite as: Akita, Y., Mimura, M., Kawahara, T. (2009) Automatic transcription system for meetings of the Japanese national congress. Proc. Interspeech 2009, 84-87, doi: 10.21437/Interspeech.2009-19

@inproceedings{akita09_interspeech,
  author={Yuya Akita and Masato Mimura and Tatsuya Kawahara},
  title={{Automatic transcription system for meetings of the Japanese national congress}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={84--87},
  doi={10.21437/Interspeech.2009-19}
}