ISCA Archive ICSLP 1994
ISCA Archive ICSLP 1994

Speech recognition with rapid environment adaptation by spectrum equalization

Keizaburo Takagi, Hiroaki Hattori, Takao Watanabe

This paper proposes a rapid environment adaptation algorithm based on spectrum equalization (REALISE). In practical speech recognition applications, differences between training and testing environments often seriously diminish recognition accuracy. These environmental differences can be classified into two types of difference: difference in additive noise and in multiplicative noise in the spectral domain. The proposed method calculates time-alignment between a testing utterance and the closest reference pattern to it, and then calculates the noise differences between the two according to the time-alignment. Then, we adapt all reference patterns to the testing environment using the differences. Finally, the testing utterance is recognized using the adapted reference patterns. In a 250 Japanese word recognition task, in which the training and testing microphones were of two different types, REALISE improved recognition accuracy from 87% to 96%.


Cite as: Takagi, K., Hattori, H., Watanabe, T. (1994) Speech recognition with rapid environment adaptation by spectrum equalization. Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994), 1023-1026

@inproceedings{takagi94b_icslp,
  author={Keizaburo Takagi and Hiroaki Hattori and Takao Watanabe},
  title={{Speech recognition with rapid environment adaptation by spectrum equalization}},
  year=1994,
  booktitle={Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994)},
  pages={1023--1026}
}