Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Construction of Speech Corpus in Moving Car Environment

Nobuo Kawaguchi (1,2), Shigeki Matsubara (1,3), Hiroyuki Iwa (1,4), Shoji Kajita (1,5), Kazuya Takeda (1,2), Fumitada Itakura (1,5), Yasuyoshi Inagaki (1,2)

(1) Center for Integrated Acoustic Information Research (CIAIR), Nagoya University
(2) Graduate School of Engineering, Nagoya University
(3) Faculty of Language and Culture, Nagoya University
(4) Kojima Press Industry Co. Ltd.; (5) Center for Information Media Studies, Nagoya University, Furo-cho, Chikusa-ku, Nagoya, Japan

The Center for Integrated Acoustic Information Research (CIAIR) at Nagoya University has been collecting speech corpora in moving cars which are made available as resources to advance the research and development of robust ASRs and spoken dialogue systems under high-noise conditions. The speech corpus consists of (1) phonetically balanced sentences, (2) digit strings, (3) discrete words and (4) transcribed spoken dialogues between drivers and information systems for navigation and information retrieval. These data are collected in vehicles under both idling and driving situations. The language of the corpus is currently Japanese. The number of subjects is currently about 300, total recording time is over 200 hours and total corpus size is about 160GByte. We have also been recording video images from three different angles, vehicle-control signals, and vehicle location, all synchronized with the speech recording. We report the objective of the speech corpus, the recording methods and the recording vehicle developed.

Full Paper

Bibliographic reference.  Kawaguchi, Nobuo / Matsubara, Shigeki / Iwa, Hiroyuki / Kajita, Shoji / Takeda, Kazuya / Itakura, Fumitada / Inagaki, Yasuyoshi (2000): "Construction of speech corpus in moving car environment", In ICSLP-2000, vol.3, 362-365.