Privacy-Preserving Siamese Feature Extraction for Gender Recognition versus Speaker Identification

Alexandru Nelus, Silas Rech, Timm Koppelmann, Henrik Biermann, Rainer Martin


In this paper we propose a deep neural-network-based feature extraction scheme with the purpose of reducing the privacy risks encountered in speaker classification tasks. For this we choose a challenging scenario where we wish to perform gender recognition but at the same time prevent an attacker who has intercepted the features to perform speaker identification. Our approach is to employ Siamese training in order to obtain a feature representation that minimizes the Euclidean distance between same gender speakers while maximizing it for different gender speakers. It is experimentally shown that the obtained effect is of anonymizing speakers from the same gender class and thus drastically reducing privacy risks while still permitting class discrimination with a higher accuracy than other previously investigated methods.


 DOI: 10.21437/Interspeech.2019-1148

Cite as: Nelus, A., Rech, S., Koppelmann, T., Biermann, H., Martin, R. (2019) Privacy-Preserving Siamese Feature Extraction for Gender Recognition versus Speaker Identification. Proc. Interspeech 2019, 3705-3709, DOI: 10.21437/Interspeech.2019-1148.


@inproceedings{Nelus2019,
  author={Alexandru Nelus and Silas Rech and Timm Koppelmann and Henrik Biermann and Rainer Martin},
  title={{Privacy-Preserving Siamese Feature Extraction for Gender Recognition versus Speaker Identification}},
  year=2019,
  booktitle={Proc. Interspeech 2019},
  pages={3705--3709},
  doi={10.21437/Interspeech.2019-1148},
  url={http://dx.doi.org/10.21437/Interspeech.2019-1148}
}