This paper introduces our actual work in front-end techniques to obtain robust speech recognition devices in mismatch conditions (additive noise mismatch and channel mismatch). Two algorithms have been combined to compensate the distortions due to different channel characteristics and additive noise: 1) A Cepstral Mean Normalization and Variance Scaling technique (MNVS) and 2) An Adaptive Gaussian Attenuation algorithm (AGA). Combining both techniques the channel distortion effects were reduced to 90% on the HTIMIT task and the additive noise effects were reduced to 80% on the TIMIT task corrupted with additive car noise.
Cite as: Menéndez-Pidal, X., Chen, R., Wu, D., Tanaka, M. (1999) Front-end improvements to reduce stationary & variable channel and noise distortions in continuous speech recognition tasks. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 2849-2852, doi: 10.21437/Eurospeech.1999-630
@inproceedings{menendezpidal99_eurospeech, author={Xavier Menéndez-Pidal and Ruxin Chen and Duanpei Wu and Mick Tanaka}, title={{Front-end improvements to reduce stationary & variable channel and noise distortions in continuous speech recognition tasks}}, year=1999, booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)}, pages={2849--2852}, doi={10.21437/Eurospeech.1999-630} }