Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition

Ondřej Novotný, Oldřich Plchot, Ondřej Glembek, Lukáš Burget


In this work, we continue in our research on i-vector extractor for speaker verification (SV) and we optimize its architecture for fast and effective discriminative training. We were motivated by computational and memory requirements caused by the large number of parameters of the original generative i-vector model. Our aim is to preserve the power of the original generative model, and at the same time focus the model towards extraction of speaker-related information. We show that it is possible to represent a standard generative i-vector extractor by a model with significantly less parameters and obtain similar performance on SV tasks. We can further refine this compact model by discriminative training and obtain i-vectors that lead to better performance on various SV benchmarks representing different acoustic domains.


 DOI: 10.21437/Interspeech.2019-1757

Cite as: Novotný, O., Plchot, O., Glembek, O., Burget, L. (2019) Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition. Proc. Interspeech 2019, 4330-4334, DOI: 10.21437/Interspeech.2019-1757.


@inproceedings{Novotný2019,
  author={Ondřej Novotný and Oldřich Plchot and Ondřej Glembek and Lukáš Burget},
  title={{Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition}},
  year=2019,
  booktitle={Proc. Interspeech 2019},
  pages={4330--4334},
  doi={10.21437/Interspeech.2019-1757},
  url={http://dx.doi.org/10.21437/Interspeech.2019-1757}
}