ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Continuous listening for unconstrained spoken dialog

Tim Paek, Eric Horvitz, Eric Ringger

A major hindrance to rendering spoken dialog systems capable of ongoing, continuous listening without requiring a push-totalk device is the problem of distinguishing speech which is intended for the system from that which is overheard. We present a decision-theoretic approach to this problem that exploits Bayesian models of spoken dialog at four levels of analysis within a domain-independent, multi-modal computational architecture called Quartet. We applied Quartet to the task of navigating PowerPoint slide shows during a spoken presentation in a prototype system called Presenter. We describe the runtime behavior of Presenter as well as the results of an experimental study comparing the performance of Presenter to human subjects in discriminating arbitrarily formed spoken requests for slide navigation during a recorded lecture.

doi: 10.21437/ICSLP.2000-34

Cite as: Paek, T., Horvitz, E., Ringger, E. (2000) Continuous listening for unconstrained spoken dialog. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 1, 138-141, doi: 10.21437/ICSLP.2000-34

  author={Tim Paek and Eric Horvitz and Eric Ringger},
  title={{Continuous listening for unconstrained spoken dialog}},
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 1, 138-141},