A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification

Lantian Li, Yixiang Chen, Dong Wang, Thomas Fang Zheng


For practical automatic speaker verification (ASV) systems, replay attack poses a true risk. By replaying a pre-recorded speech signal of the genuine speaker, ASV systems tend to be easily fooled. An effective replay detection method is therefore highly desirable. In this study, we investigate a major difficulty in replay detection: the over-fitting problem caused by variability factors in speech signal. An F-ratio probing tool is proposed and three variability factors are investigated using this tool: speaker identity, speech content and playback & recording device. The analysis shows that device is the most influential factor that contributes the highest over-fitting risk. A frequency warping approach is studied to alleviate the over-fitting problem, as verified on the ASV-spoof 2017 database.


 DOI: 10.21437/Interspeech.2017-456

Cite as: Li, L., Chen, Y., Wang, D., Zheng, T.F. (2017) A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification. Proc. Interspeech 2017, 92-96, DOI: 10.21437/Interspeech.2017-456.


@inproceedings{Li2017,
  author={Lantian Li and Yixiang Chen and Dong Wang and Thomas Fang Zheng},
  title={A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification},
  year=2017,
  booktitle={Proc. Interspeech 2017},
  pages={92--96},
  doi={10.21437/Interspeech.2017-456},
  url={http://dx.doi.org/10.21437/Interspeech.2017-456}
}