5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997

A Multimedia Platform for Audio-Visual Speech Processing

Ali Adjoudani, Thierry Guiard-Marigny, Bertrand Le Goff, Lionel Reveret, Christian Benoit

Institut de la Communication Parlee UPRESA, CNRS n 5009 INPG-ENSERG Universite Stendhal, Grenoble, France

In the framework of the European ESPRIT Project MIAMI ("Multimodal Integration for Advanced Multimedia Interfaces"), a platform has been developed at the ICP to study the various combinations of audio-visual speech processing, including real-time lip motion analysis, real-time synthesis of models of the lips and of the face, audiovisual speech recognition of isolated words, and text-to-audio-visual speech synthesis in French. All these facilities are implemented on a network of three SGI computers. Not only this platform is a usefull research tool to study the production and the perception of visible speech as well as audio-visual integration by humans and by the machines, but it is also a nice testbed to study man-machine multimodal interaction and very low bit rate audio-visual speech communication between humans.

