ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Importance of nasality measures for speaker recognition data selection and performance prediction

Howard Lei, Eduardo Lopez-Gonzalo

We improve upon measures relating feature vector distributions to speaker recognition (SR) performances for SR performance prediction and arbitrary data selection. In particular, we examine the means and variances of 11 features pertaining to nasality (resulting in 22 measures), computing them on feature vectors of phones to determine which measures give good SR performance prediction of phones. We’ve found that the combination of nasality measures give a 0.917 correlation with the Equal Error Rates (EERs) of phones on SRE08, exceeding the correlation of our previous best measure (mutual information) by 12.7%. When implemented in our data-selection scheme (which does not require a SR system to be run), the nasality measures allow us to select data with combined EER better than data selected via running a SR system in certain cases, at a fortieth of the computational costs. The nasality measures require a tenth of the computational costs compared to our previous best measure.


doi: 10.21437/Interspeech.2009-268

Cite as: Lei, H., Lopez-Gonzalo, E. (2009) Importance of nasality measures for speaker recognition data selection and performance prediction. Proc. Interspeech 2009, 888-891, doi: 10.21437/Interspeech.2009-268

@inproceedings{lei09_interspeech,
  author={Howard Lei and Eduardo Lopez-Gonzalo},
  title={{Importance of nasality measures for speaker recognition data selection and performance prediction}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={888--891},
  doi={10.21437/Interspeech.2009-268}
}