This paper describes the Nuance–Politecnico di Torino (NPT) speaker recognition system submitted to the NIST SRE16 evaluation campaign. Included are the results of post-evaluation tests, focusing on the analysis of the performance of generative and discriminative classifiers, and of score normalization. The submitted system combines the results of four GMM-IVector models, two DNN-IVector models and a GMM-SVM acoustic system. Each system exploits acoustic front-end parameters that differ by feature type and dimension. We analyze the main components of our submission, which contributed to obtaining 8.1% EER and 0.532 actual Cprimary in the challenging SRE16 Fixed condition.
Cite as: Colibro, D., Vair, C., Dalmasso, E., Farrell, K., Karvitsky, G., Cumani, S., Laface, P. (2017) Nuance - Politecnico di Torino’s 2016 NIST Speaker Recognition Evaluation System. Proc. Interspeech 2017, 1338-1342, doi: 10.21437/Interspeech.2017-797
@inproceedings{colibro17_interspeech, author={Daniele Colibro and Claudio Vair and Emanuele Dalmasso and Kevin Farrell and Gennady Karvitsky and Sandro Cumani and Pietro Laface}, title={{Nuance - Politecnico di Torino’s 2016 NIST Speaker Recognition Evaluation System}}, year=2017, booktitle={Proc. Interspeech 2017}, pages={1338--1342}, doi={10.21437/Interspeech.2017-797} }