16th Annual Conference of the International Speech Communication Association

Dresden, Germany
September 6-10, 2015

Analysis of the Second Phase of the 2013-2014 i-Vector Machine Learning Challenge

Désiré Bansé (1), George R. Doddington (1), Daniel Garcia-Romero (2), John J. Godfrey (2), Craig S. Greenberg (1), Jaime Hernández-Cordero (3), John M. Howard (1), Alvin F. Martin (1), Lisa P. Mason (3), Alan McCree (2), Douglas A. Reynolds (4)

(2) Johns Hopkins University, USA
(3) DOD, USA
(4) MIT Lincoln Laboratory, USA

In late 2013 and 2014, the National Institute of Standards and Technology (NIST) coordinated an i-vector challenge utilizing data from previous NIST Speaker Recognition Evaluations. Following the evaluation period, a second phase of the challenge was held, where speaker labels were made available for system development. The second phase included system submissions from 23 participants representing 13 different countries, of which 18 also participated in the first phase of the challenge. The top 10 systems participating in both of the challenge phases demonstrated an average relative improvement of approximately 26% between the first and second phases, which represents the value of having access to the speaker labels. The top five participants submitted a system that outperformed the oracle system from the first phase on the evaluation data.

Full Paper

Bibliographic reference.  Bansé, Désiré / Doddington, George R. / Garcia-Romero, Daniel / Godfrey, John J. / Greenberg, Craig S. / Hernández-Cordero, Jaime / Howard, John M. / Martin, Alvin F. / Mason, Lisa P. / McCree, Alan / Reynolds, Douglas A. (2015): "Analysis of the second phase of the 2013-2014 i-vector machine learning challenge", In INTERSPEECH-2015, 3041-3045.