11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Large Margin Gaussian Mixture Models for Speaker Identification

Reda Jourani (1), Khalid Daoudi (2), Régine André-Obrecht (1), Driss Aboutajdine (3)

(1) IRIT, France
(2) INRIA, France
(3) Université Mohammed V - Agdal, Morocco

Gaussian mixture model (GMM) have been widely and successfully used in speaker recognition during the last decade. However, they are generally trained using the generative criterion of maximum likelihood estimation. In this paper, we propose a simple and efficient discriminative approach to learn GMM with a large margin criterion to solve the classification problem. Our approach is based on a recent work about the Large Margin GMM (LM-GMM) where each class is modeled by a mixture of ellipsoids and which has shown good results in speech recognition; we propose a simplification of the original algorithm. We carry out preliminary experiments on a speaker identification task using NIST-SRE'2006 data and we compare the traditional generative GMM approach, the original LM-GMM one and our own version. The results suggest that our algorithm outperforms the two others.

Full Paper

Bibliographic reference.  Jourani, Reda / Daoudi, Khalid / André-Obrecht, Régine / Aboutajdine, Driss (2010): "Large margin Gaussian mixture models for speaker identification", In INTERSPEECH-2010, 1441-1444.