EUROSPEECH 2003 - INTERSPEECH 2003
This paper presents a study on the relationship between feature post-processing and speaker modelling techniques for robust text-independent speaker recognition. A fully coupled target and background Gaussian mixture speaker model structure is used for hypothesis testing in this speaker model based recognition system. Two formulations of the Maximum a Posteriori (MAP) adaptation algorithm for Gaussian mixture models are considered. We contrast the standard single iteration adaptation algorithm to adaptation using multiple iterations. Three post-processing techniques for cepstral features are considered; feature warping, cepstral mean subtraction (CMS) and RelAtive SpecTrA (RASTA) processing. It is shown that the advantage gained through iterative MAP adaptation is dependent on the parameterisation technique used. Reasons for this dependency are discussed.
Bibliographic reference. Vogt, Robbie / Pelecanos, Jason / Sridharan, Sridha (2003): "Dependence of GMM adaptation on feature post-processing for speaker recognition", In EUROSPEECH-2003, 3013-3016.