In recent years, speech recognition researchers have proposed the use of Gaussian warping as a step in the computation of input speech feature parameters. This warping is intended to reduce the mismatch between the actual statistical distribution of parameters and the distribution hypothesized in the acoustic models used, i.e., the Gaussian distribution. In this paper, we compare variants of Gaussianization, including off-line and windowed (short-term) versions, which we evaluate on a corpus of Canadian Parliamentary Debates.
Cite as: Ouellet, P., Boulianne, G., Kenny, P. (2005) Flavors of Gaussian warping. Proc. Interspeech 2005, 2957-2960, doi: 10.21437/Interspeech.2005-128
@inproceedings{ouellet05_interspeech, author={Pierre Ouellet and Gilles Boulianne and Patrick Kenny}, title={{Flavors of Gaussian warping}}, year=2005, booktitle={Proc. Interspeech 2005}, pages={2957--2960}, doi={10.21437/Interspeech.2005-128} }