ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Improved unsupervised NAP training dataset design for speaker recognition

Hanwu Sun, Bin Ma

The Nuisance Attribute Project (NAP) with labeled data provides an effective approach for improving the speaker recognition performance in the state-of-art speaker recognition system by removing unwanted channel and handset variation. However, the requirement for the labeled NAP training data may limit its practical application. In our previous study, a simple unsupervised clustering algorithm based on dot products between supervectors was introduced for designing NAP training dataset without a prior knowledge about channel and speaker information. Using such clustering results as the initial training dataset, in this paper, we make a further improvement of the training dataset by enhancing similarity measurement of supervectors via NAP projection and score normalization. The effectiveness of this unsupervised NAP training dataset design strategy has been verified in the experiments using the in-house development dataset of IIR submission for the 2012 NIST SRE.


doi: 10.21437/Interspeech.2013-473

Cite as: Sun, H., Ma, B. (2013) Improved unsupervised NAP training dataset design for speaker recognition. Proc. Interspeech 2013, 1991-1995, doi: 10.21437/Interspeech.2013-473

@inproceedings{sun13_interspeech,
  author={Hanwu Sun and Bin Ma},
  title={{Improved unsupervised NAP training dataset design for speaker recognition}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={1991--1995},
  doi={10.21437/Interspeech.2013-473}
}