Auditory-Visual Speech Processing (AVSP'98)

December 4-6, 1998
Terrigal - Sydney, Australia

Audiovisual Speech Coder : Using Vector Quantization To Exploit The Audio/Video Correlation

Elodie Foucher, Laurent Girin, Gang Feng

Institut de la Communication Parlee, (France)

Visual information can help listeners to better understand what is said. In the speech coding domain, it will be shown that it allows to reduce the transmission rate of a classic vocoder (1,9 kbit/s instead of 2,4 kbit/s) by estimating audio parameters from video ones. In addition, vector quantization seems to be a good method to reduce the redundancy between some audio and visual coefficients. With the vector quantization, we can reduce again the bit rate while decreasing the quantization error.


Full Paper

Bibliographic reference.  Foucher, Elodie / Girin, Laurent / Feng, Gang (1998): "Audiovisual speech coder: using vector quantization to exploit the audio/video correlation", In AVSP-1998, 67-72.