An end-to-end spoofing countermeasure for automatic speaker verification using evolving recurrent neural networks

Giacomo Valenti, Héctor Delgado, Massimiliano Todisco, Nicholas Evans, Laurent Pilati


Research in anti-spoofing for automatic speaker verification has advanced considerably in the last three years. Anti-spoofing is a particularly difficult pattern classification problem since the characteristics of spoofed speech vary considerably and can never be predicted with certainty in the wild. The design of features suited to the detection of unpredictable spoofing attacks is thus a staple of current research. End-to-end approaches to spoofing detection with exploit automatic feature learning have shown success and offer obvious appeal. This paper presents our efforts to develop such a system using recurrent neural networks and a particular algorithm known as neuroevolution of augmenting topologies (NEAT). Contributions include a new fitness function for network learning that not only results in better generalisation than the baseline system, but which also improves on raw performance by 22% relative when assessed using the ASVspoof 2017 database of bona fide speech and replay spoofing attacks. Results also show that mini-batch training helps to improve generalisation, a technique which could also be of benefit to other solutions to the spoofing detection problem.


 DOI: 10.21437/Odyssey.2018-41

Cite as: Valenti, G., Delgado, H., Todisco, M., Evans, N., Pilati, L. (2018) An end-to-end spoofing countermeasure for automatic speaker verification using evolving recurrent neural networks . Proc. Odyssey 2018 The Speaker and Language Recognition Workshop, 288-295, DOI: 10.21437/Odyssey.2018-41.


@inproceedings{Valenti2018,
  author={Giacomo Valenti and Héctor Delgado and Massimiliano Todisco and Nicholas Evans and Laurent Pilati},
  title={An end-to-end spoofing countermeasure for automatic speaker verification using evolving recurrent neural networks	},
  year=2018,
  booktitle={Proc. Odyssey 2018 The Speaker and Language Recognition Workshop},
  pages={288--295},
  doi={10.21437/Odyssey.2018-41},
  url={http://dx.doi.org/10.21437/Odyssey.2018-41}
}