2001: A Speaker Odyssey - The Speaker Recognition Workshop
June 18-22, 2001
The hypothesis that for a given amount of training data a speaker model has an optimum number of components is examined. This is investigated with regard to Gaussian mixture models with and without world model adaptation. Results show that maximising the number of components in a speaker model can improve speaker recognition results. Comparisons with vector quantisation indicate that sensible use of out-of-class data is essential for optimising a recognition system.
Bibliographic reference. Stapert, Robert / Mason, John S. (2001): "Speaker recognition and the acoustic speech space", In ODYSSEY-2001, 195-199.