ISCA Archive Odyssey 2022
ISCA Archive Odyssey 2022

Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion

Hye-jin Shim, Hemlata Tak, Xuechen Liu, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung, Soo-Whan Chung, Ha-Jin Yu, Bong-Jin Lee, Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md Sahidullah, Tomi Kinnunen, Nicholas Evans

Deep learning has brought impressive progress in the study of both automatic speaker verification (ASV) and spoofing countermeasures (CM). Although solutions are mutually dependent, they have typically evolved as standalone sub-systems whereby CM solutions are usually designed for a fixed ASV system. The work reported in this paper aims to gauge the improvements in reliability that can be gained from their closer integration. Results derived using the popular ASVspoof2019 dataset indicate that the equal error rate (EER) of a state-of-the-art ASV system degrades from 1.63% to 23.83% when the evaluation protocol is extended with spoofed trials. However, even the straightforward integration of ASV and CM systems in the form of score-sum and deep neural network-based fusion strategies reduce the EER to 1.71% and 6.37%, respectively. The new Spoofing-Aware Speaker Verification (SASV) challenge has been formed to encourage greater attention to the integration of ASV and CM systems as well as to provide a means to benchmark different solutions.


doi: 10.21437/Odyssey.2022-46

Cite as: Shim, H.-j., Tak, H., Liu, X., Heo, H.-S., Jung, J.-w., Chung, J.S., Chung, S.-W., Yu, H.-J., Lee, B.-J., Todisco, M., Delgado, H., Lee, K.A., Sahidullah, M., Kinnunen, T., Evans, N. (2022) Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion. Proc. The Speaker and Language Recognition Workshop (Odyssey 2022), 330-337, doi: 10.21437/Odyssey.2022-46

@inproceedings{shim22_odyssey,
  author={Hye-jin Shim and Hemlata Tak and Xuechen Liu and Hee-Soo Heo and Jee-weon Jung and Joon Son Chung and Soo-Whan Chung and Ha-Jin Yu and Bong-Jin Lee and Massimiliano Todisco and Héctor Delgado and Kong Aik Lee and Md Sahidullah and Tomi Kinnunen and Nicholas Evans},
  title={{Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion}},
  year=2022,
  booktitle={Proc. The Speaker and Language Recognition Workshop (Odyssey 2022)},
  pages={330--337},
  doi={10.21437/Odyssey.2022-46}
}