INTERSPEECH 2004 - ICSLP
In a review paper about audio-visual (AV) fusion models in speech perception, we (Schwartz et al., 1998) proposed a taxonomy of models around two basic questions: architecture and control. Six years after, it appears that the proposals we made still seem rather convenient for discussing major questions about AV fusion. Moreover -- and more importantly -- recent experimental and theoretical progress seem to provide some elements of answer in both aspects. The aim of this paper is to review these elements, and to incorporate them into the general architecture-and-control framework.
Bibliographic reference. Schwartz, Jean-Luc / Cathiard, Marie (2004): "Modeling audio-visual speech perception: back on fusion architectures and fusion control", In INTERSPEECH-2004, 2017-2020.