8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Entropy based Combination of Tandem Representations for Noise Robust ASR

Shajith Ikbal, Hemant Misra, Sunil Sivadas, Hynek Hermansky, Hervé Bourlard

IDIAP, Switzerland

In this paper, we present an entropy based method to combine tandem representations of the recently proposed Phase AutoCorrelation (PAC) based features and Mel-Frequency Cepstral Coefficients (MFCC) features. PAC based features, derived from a nonlinear transformation of autocorrelation coefficients and shown to be noise robust, improve their robustness to additive noise in their tandem representation. On the other hand, MFCC features in their tandem representation show a significant improvement in recognition performance on clean speech.An entropy based combination method investigated in this paper adaptively gives a higher weighting to the representation of MFCC features in clean speech and to the representation of PAC based features in noisy speech, thus yielding a robust recognition performance in all conditions.

Full Paper

Bibliographic reference.  Ikbal, Shajith / Misra, Hemant / Sivadas, Sunil / Hermansky, Hynek / Bourlard, Hervé (2004): "Entropy based combination of tandem representations for noise robust ASR", In INTERSPEECH-2004, 2553-2556.