ISCA Archive Interspeech 2006
ISCA Archive Interspeech 2006

A cohort - UBM approach to mitigate data sparseness for in-set/out-of-set speaker recognition

Vinod Prakash, John H. L. Hansen

In this study, the problem of identifying in-set versus out-of-set speakers is addressed. Here the emphasis is on low enrolment and test data durations, in a text-independent mode. In order to compensate for the limited enrolment data (5 sec), a method is proposed that utilizes data from speakers that are acoustically close to a particular in-set speaker. A speaker specific model is obtained by adaptation of a base model that is built using data from such speakers. The performance of the proposed algorithm is evaluated using the TIMIT database with an adapted GMM classifier (GMM-UBM) employed as the baseline system. Experimental results show a consistent increase in system performance, with a relative improvement ranging from 10.57-58.33% depending on inset speaker size and test data duration.


doi: 10.21437/Interspeech.2006-174

Cite as: Prakash, V., Hansen, J.H.L. (2006) A cohort - UBM approach to mitigate data sparseness for in-set/out-of-set speaker recognition. Proc. Interspeech 2006, paper 1847-Tue1CaP.8, doi: 10.21437/Interspeech.2006-174

@inproceedings{prakash06_interspeech,
  author={Vinod Prakash and John H. L. Hansen},
  title={{A cohort - UBM approach to mitigate data sparseness for in-set/out-of-set speaker recognition}},
  year=2006,
  booktitle={Proc. Interspeech 2006},
  pages={paper 1847-Tue1CaP.8},
  doi={10.21437/Interspeech.2006-174}
}