Modulation Dynamic Features for the Detection of Replay Attacks

Gajan Suthokumar, Vidhyasaharan Sethu, Chamith Wijenayake, Eliathamby Ambikairajah


The development of automatic systems that can detect replayed speech has emerged as a significant research challenge for securing voice biometric systems and is the focus of this paper. Specifically, this paper proposes two novel features to capture the static and dynamic characteristics of the signal from the modulation spectrum, which complement short term spectral features for use in replay detection. The modulation spectral centroid frequency feature is proposed as a vector representation of the first order spectral moments of the modulation spectrum. In conjunction to this, the long term spectral average serves to capture the static characteristics of the modulation spectrum. The proposed system, employing a GMM back-end, was evaluated on the ASVSpoof 2017 dataset and found to yield an EER of 6.54%.


 DOI: 10.21437/Interspeech.2018-1846

Cite as: Suthokumar, G., Sethu, V., Wijenayake, C., Ambikairajah, E. (2018) Modulation Dynamic Features for the Detection of Replay Attacks. Proc. Interspeech 2018, 691-695, DOI: 10.21437/Interspeech.2018-1846.


@inproceedings{Suthokumar2018,
  author={Gajan Suthokumar and Vidhyasaharan Sethu and Chamith Wijenayake and Eliathamby Ambikairajah},
  title={Modulation Dynamic Features for the Detection of Replay Attacks},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={691--695},
  doi={10.21437/Interspeech.2018-1846},
  url={http://dx.doi.org/10.21437/Interspeech.2018-1846}
}