ISCA Archive Interspeech 2007
ISCA Archive Interspeech 2007

Score distribution scaling for speaker recognition

Vinod Prakash, John H. L. Hansen

In this study, we transform the verification scores of a speaker recognition system in order to standardize the imposter score distribution, this facilitates setting of a speaker-independent threshold at desired False Alarm (FA) rates. Impostor score distributions are estimated using GMMs, and a univariate Gaussianization [1] transform (which is a monotonically increasing mapping) is applied on the scores. It is shown that if a monotonically increasing mapping is used, the Probability of correct detection for a given setting of the FA is maintained as before. Hence, the proposed technique performs distribution scaling without affecting the False Alarm to False Reject relationship of the original test statistic. The maximum (relative) mismatch between the obtained and desired False Alarm rates is less than 10% for a wide range of False Alarm rates. When compared to modeling the imposter score distributions using a single Gaussian (Z-norm case), the overall relative mismatch is reduced by an average of 30%. While the application focus is on speaker recognition, the proposed technique can be used for other binary speech classification tasks as well.

doi: 10.21437/Interspeech.2007-167

Cite as: Prakash, V., Hansen, J.H.L. (2007) Score distribution scaling for speaker recognition. Proc. Interspeech 2007, 2029-2032, doi: 10.21437/Interspeech.2007-167

  author={Vinod Prakash and John H. L. Hansen},
  title={{Score distribution scaling for speaker recognition}},
  booktitle={Proc. Interspeech 2007},