NIST Speaker Recognition Evaluation 2016 has revealed the importance of score normalization for mismatched data conditions. This paper analyzes several score normalization techniques for test conditions with multiple languages. The best performing one for a PLDA classifier is an adaptive s-norm with 30% relative improvement over the system without any score normalization. The analysis shows that the adaptive score normalization (using top scoring files per trial) selects cohorts that in 68% contain recordings from the same language and in 92% of the same gender as the enrollment and test recordings. Our results suggest that the data to select score normalization cohorts should be a pool of several languages and channels and if possible, its subset should contain data from the target domain.
Cite as: Matějka, P., Novotný, O., Plchot, O., Burget, L., Sánchez, M.D., Černocký, J. (2017) Analysis of Score Normalization in Multilingual Speaker Recognition. Proc. Interspeech 2017, 1567-1571, doi: 10.21437/Interspeech.2017-803
@inproceedings{matejka17_interspeech, author={Pavel Matějka and Ondřej Novotný and Oldřich Plchot and Lukáš Burget and Mireia Diez Sánchez and Jan Černocký}, title={{Analysis of Score Normalization in Multilingual Speaker Recognition}}, year=2017, booktitle={Proc. Interspeech 2017}, pages={1567--1571}, doi={10.21437/Interspeech.2017-803} }