ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

The I3a speaker recognition system for NIST SRE12: post-evaluation analysis

Jesús Villalba, Eduardo Lleida, Alfonso Ortega, Antonio Miguel

The I3A submission for the recent NIST 2012 speaker recognition evaluation (SRE) was based on the i-vector approach with a multichannel PLDA classifier. This PLDA is modified so that, for each i-vector, the between-class covariance depends on the type of channel where the segment was recorded (telephone, interviews, clean, noisy, etc). In this paper, we present the description of our submission and a detailed post-evaluation analysis of the results. We analyze several factors affecting performance: enrollment data selection, classifier type, scoring technique, calibration, known and unknown non-targets, target speakers included or not in development, segment duration, noise level and noise type. Some of these factor are new in this evaluation. After post-evaluation, actual costs improve by 15.43% depending on the common condition.


doi: 10.21437/Interspeech.2013-690

Cite as: Villalba, J., Lleida, E., Ortega, A., Miguel, A. (2013) The I3a speaker recognition system for NIST SRE12: post-evaluation analysis. Proc. Interspeech 2013, 3679-3683, doi: 10.21437/Interspeech.2013-690

@inproceedings{villalba13c_interspeech,
  author={Jesús Villalba and Eduardo Lleida and Alfonso Ortega and Antonio Miguel},
  title={{The I3a speaker recognition system for NIST SRE12: post-evaluation analysis}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={3679--3683},
  doi={10.21437/Interspeech.2013-690}
}