ISCA Archive SPSC 2022
ISCA Archive SPSC 2022

NWPU-ASLP System for the VoicePrivacy 2022 Challenge

Jixun Yao, Qing Wang, Li Zhang, Pengcheng Guo, Yuhao Liang, Lei Xie

This paper presents the NWPU-ASLP speaker anonymization system for VoicePrivacy 2022 Challenge. Our submission does not involve additional Automatic Speaker Verification (ASV) model or x-vector pool. Our system consists of four modules, including feature extractor, acoustic model, anonymization module, and neural vocoder. First, the feature extractor extracts the Phonetic Posteriorgram (PPG) and pitch from the input speech signal. Then, we reserve a pseudo speaker ID from a speaker look-up table (LUT), which is subsequently fed into a speaker encoder to generate the pseudo speaker embedding that is not corresponding to any real speaker. To ensure the pseudo speaker is distinguishable, we further average the randomly selected speaker embedding and weighted concatenate it with the pseudo speaker embedding to generate the anonymized speaker embedding. Finally, the acoustic model outputs the anonymized mel-spectrogram from the anonymized speaker embedding and a modified version of HifiGAN transforms the mel-spectrogram into the anonymized speech waveform. Experimental results demonstrate the effectiveness of our proposed anonymization system.

Cite as: Yao, J., Wang, Q., Zhang, L., Guo, P., Liang, Y., Xie, L. (2022) NWPU-ASLP System for the VoicePrivacy 2022 Challenge. Proc. 2nd Symposium on Security and Privacy in Speech Communication,

  author={Jixun Yao and Qing Wang and Li Zhang and Pengcheng Guo and Yuhao Liang and Lei Xie},
  title={{NWPU-ASLP System for the VoicePrivacy 2022 Challenge}},
  booktitle={Proc. 2nd Symposium on Security and Privacy in Speech Communication},