8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Prosody Change and Response Timing Analysis in Spontaneously Spoken Dialogs and Their Modeling in a Spoken Dialog System

Ryota Nishimura (1), Norihide Kitaoka (2), Seiichi Nakagawa (1)

(1) Toyohashi University of Technology, Japan
(2) Nagoya University, Japan

If a dialog system were to respond to a user as naturally as a human, interaction would be smoother. Imitating the human prosodic behavior of utterances is important in computer-human natural conversations. In this paper, to develop a cooperative/ friendly spoken dialog system, we analyzed the correlations between F0 synchrony tendency or overlap frequency and subjective measures: "liveliness," "familiarity," and "informality" in human-human dialogs. We also modeled the properties of these features and implemented the model on our dialog system that generated the response timing of aizuchi (back-channel), turn-taking based on a decision tree in real time, and dynamical F0 changes to realize chat-like conversations.

Full Paper

Bibliographic reference.  Nishimura, Ryota / Kitaoka, Norihide / Nakagawa, Seiichi (2007): "Prosody change and response timing analysis in spontaneously spoken dialogs and their modeling in a spoken dialog system", In INTERSPEECH-2007, 2565-2568.