ISCA Archive Interspeech 2006
ISCA Archive Interspeech 2006

Within-class covariance normalization for SVM-based speaker recognition

Andrew O. Hatch, Sachin Kajarekar, Andreas Stolcke

This paper extends the within-class covariance normalization (WCCN) technique described in [1, 2] for training generalized linear kernels. We describe a practical procedure for applying WCCN to an SVM-based speaker recognition system where the input feature vectors reside in a high-dimensional space. Our approach involves using principal component analysis (PCA) to split the original feature space into two subspaces: a low-dimensional "PCA space" and a high-dimensional "PCA-complement space." After performing WCCN in the PCA space, we concatenate the resulting feature vectors with a weighted version of their PCA-complements. When applied to a state-of-the-art MLLR-SVM speaker recognition system, this approach achieves improvements of up to 22% in EER and 28% in minimum decision cost function (DCF) over our previous baseline. We also achieve substantial improvements over an MLLR-SVM system that performs WCCN in the PCA space but discards the PCA-complement.


doi: 10.21437/Interspeech.2006-183

Cite as: Hatch, A.O., Kajarekar, S., Stolcke, A. (2006) Within-class covariance normalization for SVM-based speaker recognition. Proc. Interspeech 2006, paper 1874-Wed1A1O.5, doi: 10.21437/Interspeech.2006-183

@inproceedings{hatch06_interspeech,
  author={Andrew O. Hatch and Sachin Kajarekar and Andreas Stolcke},
  title={{Within-class covariance normalization for SVM-based speaker recognition}},
  year=2006,
  booktitle={Proc. Interspeech 2006},
  pages={paper 1874-Wed1A1O.5},
  doi={10.21437/Interspeech.2006-183}
}