ISCA Archive Eurospeech 2001
ISCA Archive Eurospeech 2001

Information fusion for robust speaker verification

Conrad Sanderson, Kuldip K. Paliwal

In this paper we have studied two information fusion approaches, namely feature vector concatenation and decision fusion, for the task of reducing error rates in a speaker verification system used in mismatched conditions. Three types of features are fused: Mel Frequency Cepstral Coefficients (MFCC), MFCC with Cepstral Mean Subtraction (CMS) and Maximum Auto-Correlation Values (MACV). We have used the mismatch sensitivity of Linear Prediction Cepstral Coefficients (LPCC) as a speech quality measure for selecting the weight of the contribution of the MFCC modality in the adaptive decision fusion approach. We show that in most cases concatenation fusion is superior to decision fusion. The results lead us to propose a hybrid fusion approach in which two combinations of concatenation fusion are further fused using adaptive decision fusion. The hybrid system is shown to have the lowest error rates on both clean and noisy speech.


doi: 10.21437/Eurospeech.2001-238

Cite as: Sanderson, C., Paliwal, K.K. (2001) Information fusion for robust speaker verification. Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 755-758, doi: 10.21437/Eurospeech.2001-238

@inproceedings{sanderson01_eurospeech,
  author={Conrad Sanderson and Kuldip K. Paliwal},
  title={{Information fusion for robust speaker verification}},
  year=2001,
  booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)},
  pages={755--758},
  doi={10.21437/Eurospeech.2001-238}
}