9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Development and Evaluation of Hands-Free Spoken Dialogue System for Railway Station Guidance

Hiroshi Saruwatari, Yu Takahashi, Hiroyuki Sakai, Shota Takeuchi, Tobias Cincarek, Hiromichi Kawanami, Kiyohiro Shikano

NAIST, Japan

In this paper, we describe development and evaluation of handsfree spoken dialogue system which is used for railway station guidance. In the application at the railway station, noise robustness is the most essential issue for the dialogue system. To address the problem, we introduce two key techniques in our proposed hands-free system; (a) blind spatial subtraction array (BSSA) as a preprocessing, which can efficiently reduce nonstationary and diffuse noises in real-time, and (b) robust voice activity detection (VAD) based on speech decoding for further improvement of speech recognition accuracy. The experimental assessment of the proposed dialogue system reveals that the combination of real-time BSSA and robust VAD can provide the recognition accuracy of more than 80% under adverse railway-station noise conditions.

Full Paper

Bibliographic reference.  Saruwatari, Hiroshi / Takahashi, Yu / Sakai, Hiroyuki / Takeuchi, Shota / Cincarek, Tobias / Kawanami, Hiromichi / Shikano, Kiyohiro (2008): "Development and evaluation of hands-free spoken dialogue system for railway station guidance", In INTERSPEECH-2008, 455-458.