INTERSPEECH 2004 - ICSLP
8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Modeling Audio-Visual Speech Perception: Back on Fusion Architectures and Fusion Control

Jean-Luc Schwartz, Marie Cathiard

Université Stendhal, France

In a review paper about audio-visual (AV) fusion models in speech perception, we (Schwartz et al., 1998) proposed a taxonomy of models around two basic questions: architecture and control. Six years after, it appears that the proposals we made still seem rather convenient for discussing major questions about AV fusion. Moreover -- and more importantly -- recent experimental and theoretical progress seem to provide some elements of answer in both aspects. The aim of this paper is to review these elements, and to incorporate them into the general architecture-and-control framework.

Full Paper

Bibliographic reference.  Schwartz, Jean-Luc / Cathiard, Marie (2004): "Modeling audio-visual speech perception: back on fusion architectures and fusion control", In INTERSPEECH-2004, 2017-2020.