Sixth International Conference on Spoken Language Processing
This paper describes a database consisting of speech and language, which we are currently constructing for the purpose of the research on machine interpretation. The database contains bilingual data of lectures and dialogues. We have collected the speech of about 72 hours in total and transcribed it into the text manually. We have investi- gated the database in order to acquire empirical knowledge of human interpreting. In this paper, we report the charac- teristic features of spoken language by Japanese-to-English interpreters.
Bibliographic reference. Aizawa, Yasuyuki / Matsubara, Shigeki / Kawaguchi, Nobuo / Toyama, Katsuhiko / Inagaki, Yasuyoshi (2000): "Spoken language corpus for machine interpretation research", In ICSLP-2000, vol.3, 398-401.