Interspeech'2005 - Eurospeech
In this paper, an MLLR-like adaptation approach is proposed whereby the transformation of the means is performed deterministically based on linearization of VTLN. Biases and adaptation of the variances are estimated statistically by the EM algorithm. In the discrete frequency domain, we show that under certain approximations, frequency warping with Mel-filterbank-based MFCCs equals a linear transformation in the cepstral domain. Utilizing the deduced linear relationship, the transformation matrix is generated by formant-like peak alignment. Experimental results using children's speech show improvements over traditional MLLR and VTLN. The improvements occur even with limited amounts of adaptation data.
Bibliographic reference. Cui, Xiaodong / Alwan, Abeer (2005): "MLLR-like speaker adaptation based on linearization of VTLN with MFCC features", In INTERSPEECH-2005, 273-276.