In a previous study we demonstrated that subjects could use prosodic features (primarily peak height and alignment) to make different interpretations of synthesized fragmentary grounding utterances. In the present study we test the hypothesis that subjects also change their behavior accordingly in a human-computer dialog setting. We report on an experiment in which subjects participate in a color-naming task in a Wizard-of-Oz controlled human-computer dialog in Swedish. The results show that two annotators were able to categorize the subjects responses based on pragmatic meaning. Moreover, the subjects response times differed significantly, depending on the prosodic features of the grounding fragment spoken by the system.
Cite as: Skantze, G., House, D., Edlund, J. (2006) User responses to prosodic variation in fragmentary grounding utterances in dialog. Proc. Interspeech 2006, paper 1229-Wed3WeS.1, doi: 10.21437/Interspeech.2006-548
@inproceedings{skantze06_interspeech, author={Gabriel Skantze and David House and Jens Edlund}, title={{User responses to prosodic variation in fragmentary grounding utterances in dialog}}, year=2006, booktitle={Proc. Interspeech 2006}, pages={paper 1229-Wed3WeS.1}, doi={10.21437/Interspeech.2006-548} }