Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

MLLR-Like Speaker Adaptation Based on Linearization of VTLN with MFCC Features

Xiaodong Cui, Abeer Alwan

University of California at Los Angeles, USA

In this paper, an MLLR-like adaptation approach is proposed whereby the transformation of the means is performed deterministically based on linearization of VTLN. Biases and adaptation of the variances are estimated statistically by the EM algorithm. In the discrete frequency domain, we show that under certain approximations, frequency warping with Mel-filterbank-based MFCCs equals a linear transformation in the cepstral domain. Utilizing the deduced linear relationship, the transformation matrix is generated by formant-like peak alignment. Experimental results using children's speech show improvements over traditional MLLR and VTLN. The improvements occur even with limited amounts of adaptation data.

Full Paper

Bibliographic reference.  Cui, Xiaodong / Alwan, Abeer (2005): "MLLR-like speaker adaptation based on linearization of VTLN with MFCC features", In INTERSPEECH-2005, 273-276.