ISCA Archive Interspeech 2021
ISCA Archive Interspeech 2021

Adversarial Voice Conversion Against Neural Spoofing Detectors

Yi-Yang Ding, Li-Juan Liu, Yu Hu, Zhen-Hua Ling

The naturalness and similarity of voice conversion have been significantly improved in recent years with the development of deep-learning-based conversion models and neural vocoders. Accordingly, the task of detecting spoofing speech also attracts research attention. In the latest ASVspoof 2019 challenge, the best spoofing detection model can distinguish most artificial utterances from natural ones. Inspired by recent progress of adversarial example generation, this paper proposes an adversarial post-processing network (APN) which generates adversarial examples against a neural-network-based spoofing detector by white-box attack. The APN model post-processes the speech waveforms generated by a baseline voice conversion system. An adversarial loss derived from the spoofing detector together with two regularization losses are applied to optimize the parameters of APN. In our experiments, using the logical access (LA) dataset of ASVspoof 2019, results show that our proposed method can improve the adversarial ability of converted speech against the spoofing detectors based on light convolution neural networks (LCNNs) effectively without degrading its subjective quality.

doi: 10.21437/Interspeech.2021-948

Cite as: Ding, Y.-Y., Liu, L.-J., Hu, Y., Ling, Z.-H. (2021) Adversarial Voice Conversion Against Neural Spoofing Detectors. Proc. Interspeech 2021, 816-820, doi: 10.21437/Interspeech.2021-948

  author={Yi-Yang Ding and Li-Juan Liu and Yu Hu and Zhen-Hua Ling},
  title={{Adversarial Voice Conversion Against Neural Spoofing Detectors}},
  booktitle={Proc. Interspeech 2021},