Auditory-Visual Speech Processing 2005

British Columbia, Canada
July 24-27, 2005

Audio-Visual Speaker Identification Using the CUAVE Database

David Dean, Patrick Lucey, Sridha Sridharan

Speech, Audio, Image and Video Research Laboratory, Queensland University of Technology, Brisbane, Australia

The freely available nature of the CUAVE database allows it to provide a valuable platform to form benchmarks and compare research. This paper shows that the CUAVE database can successfully be used to test speaker identifications systems, with performance comparable to existing systems implemented on other databases. Additionally, this research shows that the optimal configuration for decision-fusion of an audio-visual speaker identification system relies heavily on the video modality in all but clean speech conditions.

Full Paper

Bibliographic reference.  Dean, David / Lucey, Patrick / Sridharan, Sridha (2005): "Audio-visual speaker identification using the CUAVE database", In AVSP-2005, 97-102.