EUROSPEECH 2003 - INTERSPEECH 2003
8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003

        

On the Fusion of Dissimilarity-Based Classifiers for Speaker Identification

Tomi Kinnunen, Ville Hautamaki, Pasi Franti

University of Joensuu, Finland

In this work, we describe a speaker identification system that uses multiple supplementary information sources for computing a combined match score for the unknown speaker. Each speaker profile in the database consists of multiple feature vector sets that can vary in their scale, dimensionality, and the number of vectors. The evidence from a given feature set is weighted by its reliability that is set in a priori fashion. The confidence of the identification result is also estimated. The system is evaluated with a corpus of 110 Finnish speakers. The evaluated feature sets include mel-cepstrum, LPC-cepstrum, dynamic cepstrum, long-term averaged spectrum of /A/ vowel, and F0.

Full Paper

Bibliographic reference.  Kinnunen, Tomi / Hautamaki, Ville / Franti, Pasi (2003): "On the fusion of dissimilarity-based classifiers for speaker identification", In EUROSPEECH-2003, 2641-2644.