14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Regularized Subspace n-Gram Model for Phonotactic iVector Extraction

Mehdi Soufifar, Lukáš Burget, Oldřich Plchot, Sandro Cumani, Jan Černocký

Brno University of Technology, Czech Republic

Phonotactic language identification (LID) by means of n-gram statistics and discriminative classifiers is a popular approach for the LID problem. Low-dimensional representation of the n-gram statistics leads to the use of more diverse and efficient machine learning techniques in the LID. Recently, we proposed phototactic iVector as a low-dimensional representation of the n-gram statistics. In this work, an enhanced modeling of the n-gram probabilities along with regularized parameter estimation is proposed. The proposed model consistently improves the LID system performance over all conditions up to 15% relative to the previous state of the art system. The new model also alleviates memory requirement of the iVector extraction and helps to speed up subspace training. Results are presented in terms of Cavg over NIST LRE2009 evaluation set.

Full Paper

Bibliographic reference.  Soufifar, Mehdi / Burget, Lukáš / Plchot, Oldřich / Cumani, Sandro / Černocký, Jan (2013): "Regularized subspace n-gram model for phonotactic ivector extraction", In INTERSPEECH-2013, 74-78.