ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Evaluation of modulation spectrum equalization techniques for large vocabulary robust speech recognition

Liang-che Sun, Chang-wen Hsu, Lin-shan Lee

Previous approaches for modulation spectrum equalization were evaluated only for the Aurora 2 small vocabulary task. We further apply these approaches on the Aurora 4 large vocabulary task. In the spectral histogram equalization (SHE) approach, we equalize the histogram of the modulation spectrum for each utterance to a reference histogram obtained from clean training data. In the magnitude ratio equalization (MRE) approach, we equalize the magnitude ratio of lower to higher frequency components on the modulation spectrum to a reference value also obtained from clean training data. Experimental test results indicate significant performance improvements using these approaches when cascaded with cepstral mean and variance normalization (CMVN). Cascading MRE with more advanced feature normalization approaches such as histogram equalization (HEQ) and higher-order cepstral moment normalization (HOCMN) yielded additional performance improvements.


doi: 10.21437/Interspeech.2008-292

Cite as: Sun, L.-c., Hsu, C.-w., Lee, L.-s. (2008) Evaluation of modulation spectrum equalization techniques for large vocabulary robust speech recognition. Proc. Interspeech 2008, 1004-1007, doi: 10.21437/Interspeech.2008-292

@inproceedings{sun08_interspeech,
  author={Liang-che Sun and Chang-wen Hsu and Lin-shan Lee},
  title={{Evaluation of modulation spectrum equalization techniques for large vocabulary robust speech recognition}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={1004--1007},
  doi={10.21437/Interspeech.2008-292}
}