Sixth European Conference on Speech Communication and Technology

Budapest, Hungary
September 5-9, 1999

An Investigation into Vocal Tract Length Normalisation

L. F. Uebel, P. C. Woodland

Cambridge University Engineering Department, Cambridge, UK

This paper investigates several different methods for performing vocal tract length normalisation (VTLN) which are either completely linear or piece-wise linear. Furthermore the combination of VTLN with either standard unconstrained maximum likelihood linear regression (MLLR) or constrained MLLR is considered. Results on the Switchboard corpus show that there is little difference in performance between the different forms of VTLN, and that as previously reported that the effects of VTLN and unconstrained MLLR are largely additive. However it was found that if multiple iterations of constrained MLLR is used there is no additional advantage to also using VTLN.

Full Paper (PDF)   Gnu-Zipped Postscript

Bibliographic reference.  Uebel, L. F. / Woodland, P. C. (1999): "An investigation into vocal tract length normalisation", In EUROSPEECH'99, 2527-2530.