11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Extended Weighted Linear Prediction (XLP) Analysis of Speech and Its Application to Speaker Verification in Adverse Conditions

Jouni Pohjalainen (1), Rahim Saeidi (2), Tomi Kinnunen (2), Paavo Alku (1)

(1) Aalto University, Finland
(2) University of Eastern Finland, Finland

This paper introduces a generalized formulation of linear prediction (LP), including both conventional and temporally weighted LP analysis methods as special cases. The temporally weighted methods have recently been successfully applied to noise robust spectrum analysis in speech and speaker recognition applications. In comparison to those earlier methods, the new generalized approach allows more versatility in weighting different parts of the data in the LP analysis. Two such weighted methods are evaluated and compared to the conventional spectrum modeling methods FFT and LP, as well as the temporally weighted methods WLP and SWLP, by substituting each of them in turn as the spectrum estimation method of the MFCC feature extraction stage of a GMM-UBM based speaker verification system. The new methods are shown to lead to performance improvement in several cases involving channel distortion and additive noise mismatch between the training and recognition conditions.

Full Paper

Bibliographic reference.  Pohjalainen, Jouni / Saeidi, Rahim / Kinnunen, Tomi / Alku, Paavo (2010): "Extended weighted linear prediction (XLP) analysis of speech and its application to speaker verification in adverse conditions", In INTERSPEECH-2010, 1477-1480.