ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems

Yuan Gong, Jian Yang, Jacob Huber, Mitchell MacKnight, Christian Poellabauer


This paper introduces a new database of voice recordings with the goal of supporting research on vulnerabilities and protection of voice-controlled systems (VCSs). In contrast to prior efforts, the proposed database contains both genuine voice commands and replayed recordings of such commands, collected in realistic VCSs usage scenarios and using modern voice assistant development kits. Specifically, the database contains recordings from four systems (each with a different microphone array) in a variety of environmental conditions with different forms of background noise and relative positions between speaker and device. To the best of our knowledge, this is the first publicly available database1 that has been specifically designed for the protection of state-of-the-art voice-controlled systems against various replay attacks in various conditions and environments.


 DOI: 10.21437/Interspeech.2019-1541

Cite as: Gong, Y., Yang, J., Huber, J., MacKnight, M., Poellabauer, C. (2019) ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems. Proc. Interspeech 2019, 2355-2359, DOI: 10.21437/Interspeech.2019-1541.


@inproceedings{Gong2019,
  author={Yuan Gong and Jian Yang and Jacob Huber and Mitchell MacKnight and Christian Poellabauer},
  title={{ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems}},
  year=2019,
  booktitle={Proc. Interspeech 2019},
  pages={2355--2359},
  doi={10.21437/Interspeech.2019-1541},
  url={http://dx.doi.org/10.21437/Interspeech.2019-1541}
}