Detection of Replay-Spoofing Attacks Using Frequency Modulation Features

Tharshini Gunendradasan, Buddhi Wickramasinghe, Ngoc Phu Le, Eliathamby Ambikairajah, Julien Epps


Prevention of malicious spoofing attacks is currently acknowledged as a priority area of investigation for the deployment of automatic speaker verification systems. Various features of speech signals have been used to fight counterfeit attacks. Among the different spoofing attack variants, replay attacks pose a significant threat as they do not require any expert knowledge and are difficult to detect. This paper proposes the use of a spectral centroid based frequency modulation (FM) features that we term spectral centroid deviation (SCD) for replay attack detection. Spectral centroid frequency (SCF) and spectral centroid magnitude coefficient (SCMC) features extracted from the same front-end as SCD are also investigated as complementary features. Evaluations on the ASVspoof 2017 dataset indicate that the proposed SCD features with a Gaussian Mixture Model (GMM) back-end is highly capable of discriminating genuine from replay spoofed speech, providing an equal error rate improvement greater than 60% relative to the CQCC baseline system from the ASVspoof 2017 challenge. Interestingly, experiments also reveal that the proposed SCD features exhibit an increased variance for replay spoofed speech relative to genuine speech, particularly for the lowest and highest frequency subbands.


 DOI: 10.21437/Interspeech.2018-1473

Cite as: Gunendradasan, T., Wickramasinghe, B., Le, N.P., Ambikairajah, E., Epps, J. (2018) Detection of Replay-Spoofing Attacks Using Frequency Modulation Features. Proc. Interspeech 2018, 636-640, DOI: 10.21437/Interspeech.2018-1473.


@inproceedings{Gunendradasan2018,
  author={Tharshini Gunendradasan and Buddhi Wickramasinghe and Ngoc Phu Le and Eliathamby Ambikairajah and Julien Epps},
  title={Detection of Replay-Spoofing Attacks Using Frequency Modulation Features},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={636--640},
  doi={10.21437/Interspeech.2018-1473},
  url={http://dx.doi.org/10.21437/Interspeech.2018-1473}
}