Multi-Modal Dialogue in Mobile Environments (IDS-02)

June 17-19, 2002
Kloster Irsee, Germany

Belief Network based Disambiguation of Object Reference in Spoken Dialogue System for Robot

Yoko Yamakata, Tatsuya Kawahara and Hiroshi G. Okuno

School of Informatics, Kyoto University, Japan

We are studying joint activity in which a remote robot finds an object by commnnicating with the user over a voice-only channel. We focus on how the robot disambiguates the reference of the uttered word or phrase to the target object. For example, by "cup", one may refer to a "teacup", a "coffee cup", or even a "glass" under some situations. This reference (hereafter, "object reference") is user-dependent. We confirm that a user model of object references is significant by conducting a survey of 12 subjects. In addition to ambiguity of object reference, actual systems should cope with two other sources of uncertainty in speech and image recognition. We present a Belief Network based probabilistic reasoning system to determine the object reference. The resulting system demonstrates that the number of interactions needed to find a common reference is reduced as the user model is refined.


Full Paper

Bibliographic reference.  Yamakata, Yoko / Kawahara, Tatsuya / Okuno, Hiroshi G. (2002): "Belief network based disambiguation of object reference in spoken dialogue system for robot", In IDS-2002, paper 46; 11 pp.