Sixth European Conference on Speech Communication and Technology

Budapest, Hungary
September 5-9, 1999

Speaker Recognition by Means of a Combination of Linear and Nonlinear Predictive Models

Marcos Faúndez-Zanuy

Escola Universitária Politécnica de Mataró, Universitat Politécnica de Catalunya (UPC), Mataró, Barcelona, Spain

This paper deals the combination of nonlinear predictive models with classical LPCC parameterization for speaker recognition. It is shown that the combination of both a measure defined over LPCC coefficients and a measure defined over predictive analysis residual signal gives rise to an improvement over the classical method that considers only the LPCC coefficients. If the residual signal is obtained from a linear prediction analysis, the improvement is 2.63% (error rate drops from 6.31% to 3.68%) and if it is computed through a nonlinear predictive neural nets based model, the improvement is 3.68%. An efficient algorithm for reducing the computational burden is also proposed.

Full Paper (PDF)   Gnu-Zipped Postscript

Bibliographic reference.  Faúndez-Zanuy, Marcos (1999): "Speaker recognition by means of a combination of linear and nonlinear predictive models", In EUROSPEECH'99, 763-766.