8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003


Adaptive Decision Fusion for Multi-Sample Speaker Verification Over GSM Networks

Ming-Cheung Cheung (1), Man-Wai Mak (1), Sun-Yuan Kung (2)

(1) Hong Kong Polytechnic University, China
(2) Princeton University, USA

In speaker verification, a claimant may produce two or more utterances. In our previous study [1], we proposed to compute the optimal weights for fusing the scores of these utterances based on their score distribution and our prior knowledge about the score statistics estimated from the mean scores of the corresponding client speaker and some pseudo-impostors during enrollment. As the fusion weights depend on the prior scores, in this paper, we propose to adapt the prior scores during verification based on the likelihood of the claimant being an impostor. To this end, a pseudo-imposter GMM score model is created for each speaker. During verification, the claimant's scores are fed to the score model to obtain a likelihood for adapting the prior score. Experimental results based on the GSM-transcoded speech of 150 speakers from the HTIMIT corpus demonstrate that the proposed prior score adaptation approach provides a relative error reduction of 15% when compared with our previous approach where the prior scores are non-adaptive.

Full Paper

Bibliographic reference.  Cheung, Ming-Cheung / Mak, Man-Wai / Kung, Sun-Yuan (2003): "Adaptive decision fusion for multi-sample speaker verification over GSM networks", In EUROSPEECH-2003, 2969-2972.