Biennial on Digital Signal Processing for In-Vehicle and Mobile Systems

Sesimbra, Portugal
September 2-3, 2005

Towards Robust Spoken Dialog Systems Using Large-Scale In-Car Speech Corpus

Yukiko Yamaguchi (1), Keita Hayashi (1), Takahiro Ono (1), Shingo Kato (1), Yuki Irie (1), Tomohiro Ohno (1), Hiroya Murao (2), Shigeki Matsubara (1), Nobuo Kawaguchi (1), Kazuya Takeda (1)

(1) Nagoya University, Furo-cho, Chikusa-ku, Nagoya, Japan
(2) SANYO Electric Co., Ltd. Hirakata-shi, Osaka, Japan

We have been studying various topics by using a large-scale corpus, which was built at CIAIR, to construct a robust and practical spoken dialogue system. The CIAIR project has developed a data collection vehicle and collected about 179 hours of multi-modal data in total. We have transcribed the speech data by about 800 subjects, and annotated speech intentions, dependency structures, dialogue structures to the text data. We are continuing various research using the annotated data, such as speech. Intention understanding and speaker’s knowledge acquisition. In this paper, we introduce our research activities, and present the various fruits of the in-car speech corpus.

Bibliographic reference.  Yamaguchi, Yukiko / Hayashi, Keita / Ono, Takahiro / Kato, Shingo / Irie, Yuki / Ohno, Tomohiro / Murao, Hiroya / Matsubara, Shigeki / Kawaguchi, Nobuo / Takeda, Kazuya (2005): "Towards robust spoken dialog systems using large-scale in-car speech corpus", In DSP-in-V-2005, paper A1-4 (abstract).