10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Enabling a User to Specify an Item at Any Time During System Enumeration - Item Identification for Barge-In-Able Conversational Dialogue Systems

Kyoko Matsuyama, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno

Kyoto University, Japan

In conversational dialogue systems, users prefer to speak at any time and to use natural expressions. We have developed an Independent Component Analysis (ICA) based semi-blind source separation method, which allows users to barge-in over system utterances at any time. We created a novel method from timing information derived from barge-in utterances to identify one item that a user indicates during system enumeration. First, we determine the timing distribution of user utterances containing referential expressions and then approximate it using a gamma distribution. Second, we represent both the utterance timing and automatic speech recognition (ASR) results as probabilities of the desired selection from the systemís enumeration. We then integrate these two probabilities to identify the item having the maximum likelihood of selection. Experimental results using 400 utterances indicated that our method outperformed two methods used as a baseline (one of ASR results only and one of utterance timing only) in identification accuracy.

Full Paper

Bibliographic reference.  Matsuyama, Kyoko / Komatani, Kazunori / Ogata, Tetsuya / Okuno, Hiroshi G. (2009): "Enabling a user to specify an item at any time during system enumeration - item identification for barge-in-able conversational dialogue systems", In INTERSPEECH-2009, 252-255.