ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Speaker recognition based on discriminative feature extraction - optimization of mel-cepstral features using second-order all-pass warping function

Chiyomi Miyajima, Hideyuki Watanabe, Tadashi Kitamura, Shigeru Katagiri

This paper describes a new framework for designing speaker recognition systems based on the discriminative feature extraction (DFE) method. We apply a mel-cepstral estimation technique to the feature extractor in a Gaussian mixture model (GMM)­based text­independent speaker identification system. The mel­cepstral estimation technique uses the second­order all­pass warping function for frequency transformation. We jointly optimize the frequency warping parameters of the feature extractor and the GMM parameters of the classifier based on a minimum classification error (MCE) criterion. Experimental results show that the frequency warped scale after optimization is different from traditional linear/mel scales; moreover, the proposed system outperforms conventional systems trained with the generalized probabilistic descent (GPD) method in which only the classifier is optimized.


doi: 10.21437/Eurospeech.1999-189

Cite as: Miyajima, C., Watanabe, H., Kitamura, T., Katagiri, S. (1999) Speaker recognition based on discriminative feature extraction - optimization of mel-cepstral features using second-order all-pass warping function. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 779-782, doi: 10.21437/Eurospeech.1999-189

@inproceedings{miyajima99_eurospeech,
  author={Chiyomi Miyajima and Hideyuki Watanabe and Tadashi Kitamura and Shigeru Katagiri},
  title={{Speaker recognition based on discriminative feature extraction - optimization of mel-cepstral features using second-order all-pass warping function}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={779--782},
  doi={10.21437/Eurospeech.1999-189}
}