Sixth International Conference on Spoken Language Processing (ICSLP 2000)

Beijing, China
October 16-20, 2000

Continuous Listening for Unconstrained Spoken Dialog

Tim Paek, Eric Horvitz, Eric Ringger

Microsoft Research, Redmond, WA, USA

A major hindrance to rendering spoken dialog systems capable of ongoing, continuous listening without requiring a push-totalk device is the problem of distinguishing speech which is intended for the system from that which is overheard. We present a decision-theoretic approach to this problem that exploits Bayesian models of spoken dialog at four levels of analysis within a domain-independent, multi-modal computational architecture called Quartet. We applied Quartet to the task of navigating PowerPoint slide shows during a spoken presentation in a prototype system called Presenter. We describe the runtime behavior of Presenter as well as the results of an experimental study comparing the performance of Presenter to human subjects in discriminating arbitrarily formed spoken requests for slide navigation during a recorded lecture.

Full Paper

Bibliographic reference.  Paek, Tim / Horvitz, Eric / Ringger, Eric (2000): "Continuous listening for unconstrained spoken dialog", In ICSLP-2000, vol.1, 138-141.