ISCA Archive SPSC 2021
ISCA Archive SPSC 2021

Federated Learning for Privacy Preserving On-Device Speaker Recognition

Abraham Woubie, Tom Bäckström

State-of-the-art speaker recognition systems are usually trained on a single computer using speech data collected from multiple users. However, these speech samples may contain private information which users are not willing to share. To overcome such potential breaches of privacy, we investigate the use of federated learning in speaker recognition. Distributed learning methods such as federated learning enable us to train a shared model without sharing the private data by training the models on edge devices where the data resides. In the proposed system, each edge device trains an individual model which is subsequently sent to a secure aggregator. To provide contrasting data without the need for transmitting data, we use a generative adversarial network (GAN) to generate impostor data at the edge. Afterwards, the secure aggregator merges the individual models, builds a global model and transmits the global model to the edge devices through a main server. Experimental results on the Voxceleb-1 dataset show that the use of federated learning for speaker recognition system provides two advantages. Firstly, it retains privacy since the raw data does not leave the edge devices. Secondly, experimental results show that the aggregated model provides better average equal error rate than the individual models.


doi: 10.21437/SPSC.2021-1

Cite as: Woubie, A., Bäckström, T. (2021) Federated Learning for Privacy Preserving On-Device Speaker Recognition. Proc. 2021 ISCA Symposium on Security and Privacy in Speech Communication, 1-5, doi: 10.21437/SPSC.2021-1

@inproceedings{woubie21_spsc,
  author={Abraham Woubie and Tom Bäckström},
  title={{Federated Learning for Privacy Preserving On-Device Speaker Recognition}},
  year=2021,
  booktitle={Proc. 2021 ISCA Symposium on Security and Privacy in Speech Communication},
  pages={1--5},
  doi={10.21437/SPSC.2021-1}
}