Speech Prosody 2004

Nara, Japan
March 23-26, 2004

Quantitative Analysis of Prosody in Task-Oriented Dialogs

Hansjörg Mixdorff

Faculty of Computer Science, TFH Berlin University of Applied Sciences, Germany

The current paper reports first results from the analysis of task-oriented dialogs using a Fujisaki model based parameterization of F0 contours. Two versions of map task style dialogs were examined: (1) the recordings made during the map task proper, (2) readings from scripts of the original dialog by the same speakers. In the scope of this paper an analysis of phrase boundaries with respect to form and function is presented. Results indicate, inter alia, that F0 cues differ considerably from what has been observed in earlier studies on read speech. In particular, the strict functional distinction between non-terminal and contact intoneme which has been established through listening experiments cannot be maintained for the map task dialogs. Nevertheless speakers in the dialog make consistent use of F0 cues associated with nonterminal and contact intonemes in read speech. A second issue touched on briefly in this paper is the problem of processing fillers, hesitations and repairs within in the framework of the Fujisaki model based analysis.

Full Paper

Bibliographic reference.  Mixdorff, Hansjörg (2004): "Quantitative analysis of prosody in task-oriented dialogs", In SP-2004, 283-286.