INTERSPEECH 2012
13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Ensemble Classifiers Using Unsupervised Data Selection for Speaker Recognition

Chien-Lin Huang (1), Chiori Hori (1), Hideki Kashioka (1), Bin Ma (2)

(1) National Institute of Information and Communications Technology, Kyoto, Japan
(2) Human Language Technology Department, Institute for Infocomm Research, A*STAR, Singapore

This paper presents an approach with ensemble classifiers using unsupervised data selection for speaker recognition. Ensemble learning is a type of machine learning that applies a combination of several weak learners to achieve accurate and improved performance than a single learner. Based on its acoustic characteristics, the speech utterance is divided into several subsets using unsupervised data selection methods. The ensemble classifiers are then trained with these non-overlapping subsets of speech data to improve the recognition accuracy. Our experiments on the 2008 and 2010 NIST Speaker Recognition Evaluation datasets show that using ensemble classifiers substantially reduces DCF.

Index Terms: speaker recognition, ensemble classifier, unsupervised data selection

Full Paper

Bibliographic reference.  Huang, Chien-Lin / Hori, Chiori / Kashioka, Hideki / Ma, Bin (2012): "Ensemble classifiers using unsupervised data selection for speaker recognition", In INTERSPEECH-2012, 2666-2669.