8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Analysis of In-car Speech Recognition Experiments using a Large-Scale Multi-Mode Dialogue Corpus

Hiroshi Fujimura, Katsunobu Itou, Kazuya Takeda, Fumitada Itakura

NAGOYA University, Japan

The dependency of conversational utterances on the mode of dialogue is analyzed. A speech corpus of 800 speakers collected under three different modes, i.e., talking to a human operator, an WOZ system and an ASR system, is used for analysis. Some characteristics such as sentence ;complexity loudness of the voice and speaking-rate are found to be significantly different among the dialogue modes. Linear regression analysis results also clarify the relative importance of those characteristics on speech recognition accuracy.

Full Paper

Bibliographic reference.  Fujimura, Hiroshi / Itou, Katsunobu / Takeda, Kazuya / Itakura, Fumitada (2004): "Analysis of in-car speech recognition experiments using a large-scale multi-mode dialogue corpus", In INTERSPEECH-2004, 2313-2316.