This paper describes a multi-modal person verification system using speech and frontal face images. We consider two different speaker verification algorithms, a text-independent method using a second-order statistical measure and a text-dependent method based on hidden Markov modelling, as well as a face verification technique using a robust form of corellation. Fusion of the different recognition modules is performed by a Support Vector Machine classifier. Experimental results obtained on the audio-visual database XM2VTS for individual modalities and their combinations show that multimodal systems yield better performances than individual modules for all cases.
Cite as: Luettin, J., Ben-Yacoub, S. (1999) Robust person verification based on speech and facial images. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 991-994, doi: 10.21437/Eurospeech.1999-242
@inproceedings{luettin99_eurospeech, author={J. Luettin and S. Ben-Yacoub}, title={{Robust person verification based on speech and facial images}}, year=1999, booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)}, pages={991--994}, doi={10.21437/Eurospeech.1999-242} }