Towards a Fault-Tolerant Speaker Verification System: A Regularization Approach to Reduce the Condition Number

Siqi Zheng, Gang Liu, Hongbin Suo, Yun Lei


Large-scale deployment of speech interaction devices makes it possible to harvest tremendous data quickly, which also introduces the problem of wrong labeling during data mining. Mislabeled training data has a substantial negative effect on the performance of speaker verification system. This study aims to enhance the generalization ability and robustness of the model when the training data is contaminated by wrong labels. Several regularization approaches are proposed to reduce the condition number of the speaker verification problem, making the model less sensitive to errors in the inputs. They are validated on both NIST SRE corpus and far-field smart speaker data. The results suggest that the performance deterioration caused by mislabeled training data can be significantly ameliorated by proper regularization.


 DOI: 10.21437/Interspeech.2019-1442

Cite as: Zheng, S., Liu, G., Suo, H., Lei, Y. (2019) Towards a Fault-Tolerant Speaker Verification System: A Regularization Approach to Reduce the Condition Number. Proc. Interspeech 2019, 4065-4069, DOI: 10.21437/Interspeech.2019-1442.


@inproceedings{Zheng2019,
  author={Siqi Zheng and Gang Liu and Hongbin Suo and Yun Lei},
  title={{Towards a Fault-Tolerant Speaker Verification System: A Regularization Approach to Reduce the Condition Number}},
  year=2019,
  booktitle={Proc. Interspeech 2019},
  pages={4065--4069},
  doi={10.21437/Interspeech.2019-1442},
  url={http://dx.doi.org/10.21437/Interspeech.2019-1442}
}