8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Irrelevant Variability Normalization Based HMM Training Using VTS Approximation of an Explicit Model of Environmental Distortions

Yu Hu, Qiang Huo

University of Hong Kong, China

In a traditional HMM compensation approach to robust speech recognition that uses Vector Taylor Series (VTS) approximation of an explicit model of environmental distortions, the set of generic HMMs are typically trained from "clean" speech only. In this paper, we present a maximum likelihood approach to training generic HMMs from both "clean" and "corrupted" speech based on the concept of irrelevant variability normalization. Evaluation results on Aurora2 connected digits database demonstrate that the proposed approach achieves significant improvements in recognition accuracy compared to the traditional VTS-based HMM compensation approach.

Full Paper

Bibliographic reference.  Hu, Yu / Huo, Qiang (2007): "Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions", In INTERSPEECH-2007, 1042-1045.