ISCA Archive ISCSLP 2004
ISCA Archive ISCSLP 2004

Data-Driven Temporal Filters based on Maximum Mutual Information for Robust Features in Speech Recognition

YungSheng Huang, Jeihweih Hung

Linear Discriminant Analysis (LDA), Principal Component Analysis (PCA) and Minimum Classification Error (MCE) have been used to derive data-driven temporal filters in order to improve the robustness of speech features for speech recognition. In this paper, the criterion of Maximum Mutual Information (MMI) is proposed for constructing the temporal filters, and detailed comparative analysis among these various approaches are presented and discussed. Experimental results show that the MMI-derived temporal filters significantly improve the recognition performance of the original MFCC features as LDA/PCA/MCE-derived filters do. Also, while the MMI-derived filters are combined with the conventional temporal filters, Cepstral Mean and Variance Normalization (CMVN), the recognition performance can be further improved.


Cite as: Huang, Y., Hung, J. (2004) Data-Driven Temporal Filters based on Maximum Mutual Information for Robust Features in Speech Recognition. Proc. International Symposium on Chinese Spoken Language Processing, 105-108

@inproceedings{huang04b_iscslp,
  author={YungSheng Huang and Jeihweih Hung},
  title={{Data-Driven Temporal Filters based on Maximum Mutual Information for Robust Features in Speech Recognition}},
  year=2004,
  booktitle={Proc. International Symposium on Chinese Spoken Language Processing},
  pages={105--108}
}