Auditory-Visual Speech Processing (AVSP'99)

August 7-10, 1999
Santa Cruz, CA, USA

A Diffusion Network Approach to Visual Speech Recognition

Javier R. Movellan, Paul Mineiro

University of California at San Diego, La Jolla, CA, USA

In this paper we present an alternative to hidden Markov models for the recognition of image sequences. The approach is based on a stochastic version of recurrent neural networks, which we call diffusion networks. Contrary to hidden Markov models, diffusion networks operate with continuous state dynamics, and generate continuous paths. This aspect that may be beneficial in computer vision tasks in which continuity is a useful constraint. In this paper we review results required for the implementation of diffusion networks, and then apply them to a visual speech recognition task. Diffusion networks outperformed the results obtained with the best hidden Markov models.

Full Paper

Bibliographic reference.  Movellan, Javier R. / Mineiro, Paul (1999): "A diffusion network approach to visual speech recognition", In AVSP-1999, paper #15.