INTERSPEECH 2004 - ICSLP
The multimodal character of speech processing has attracted research endeavors that range from engineers working on automatic speech recognition through to psychologists interested in language processing. This paper serves as a broad introduction to the special Audio-Visual Spoken Language (AVSP) session. The paper focuses on recent developments in the area and touches on theory as well as application. We consider how the scope of the conception of AVSP has broadened; how issues concerning the developmental aspects of AVSP have been tackled; how measures of AV processing have become more sophisticated and how new applications incorporating the visual aspects of speech have been devised. The main aim of this limited review is to highlight the range of the issues and the innovative nature of research programs that fall within this area.
Bibliographic reference. Kim, Jinyoung / Kim, Jeesun / Davis, Chris (2004): "Audio-visual spoken language processing", In INTERSPEECH-2004, 1133-1136.