Sixth European Conference on Speech Communication and Technology
(EUROSPEECH'99)

Budapest, Hungary
September 5-9, 1999

Text-Independent Speaker Verification Using Virtual Speaker Based Cohort Normalization

Toshihiro Isobe, Jun-ichi Takahashi

Laboratory for Information Technology, NTT Data Corporation, Tokyo, Japan

In this paper, we propose a new score normalization method for text-independent speaker verification using GMM (Gaussian Mixture Model). In the proposed method, cohort model is designed as virtual speaker model based on the similarity of local acoustic information between the reference speaker and other customers. The similarity is determined using statistical distance between model components such as the Gaussian distributions. Therefore, synthesized cohort model is statistically close to the reference speaker model, and can provide an effective normalizing score for various observed measurements. The experimental results using telephone speech of 60 speakers showed that the proposed method is superior to the typical methods with cohort speaker model or pooled model. Equal Error Rate (EER) when using common posteriori-defined threshold value for every speakers was drastically reduced from 3.82 % (for the conventional normalization with cohort speaker model) or 10.3 % (for normalization with pooled model) to 2.50 % (for the proposed method) when cohort size is equal to three.


Full Paper (PDF)   Gnu-Zipped Postscript

Bibliographic reference.  Isobe, Toshihiro / Takahashi, Jun-ichi (1999): "Text-independent speaker verification using virtual speaker based cohort normalization", In EUROSPEECH'99, 987-990.