Sixth International Conference on Spoken Language Processing
The Center for Integrated Acoustic Information Research (CIAIR) at Nagoya University has been collecting speech corpora in moving cars which are made available as resources to advance the research and development of robust ASRs and spoken dialogue systems under high-noise conditions. The speech corpus consists of (1) phonetically balanced sentences, (2) digit strings, (3) discrete words and (4) transcribed spoken dialogues between drivers and information systems for navigation and information retrieval. These data are collected in vehicles under both idling and driving situations. The language of the corpus is currently Japanese. The number of subjects is currently about 300, total recording time is over 200 hours and total corpus size is about 160GByte. We have also been recording video images from three different angles, vehicle-control signals, and vehicle location, all synchronized with the speech recording. We report the objective of the speech corpus, the recording methods and the recording vehicle developed.
Bibliographic reference. Kawaguchi, Nobuo / Matsubara, Shigeki / Iwa, Hiroyuki / Kajita, Shoji / Takeda, Kazuya / Itakura, Fumitada / Inagaki, Yasuyoshi (2000): "Construction of speech corpus in moving car environment", In ICSLP-2000, vol.3, 362-365.