ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

The effects of room acoustics on MFCC speech parameter

Yue Pan, Alex Waibel

Automatic speech recognition systems attain high performance for close-talking applications, but they deteriorate significantly in distant-talking environment. The reason is the mismatch between training and testing conditions. We have carried out a research work for a better understanding of the effects of room acoustics on speech feature by comparing simultaneous recordings of close talking and distant talking speech utterances. The characteristics of two degrading sources, background noise and room reverberation are discussed. Their impacts on the spectrum are different. The noise affects on the valley of the spectrum while the reverberation causes the distortion at the peaks at the pitch frequency and its multiples. In the situation of very few training data, we attempt to choose the efficient compensation approaches in the spectrum, spectrum subband or cepstrum domain. Vector Quantization based model is used to study the influence of the variation on feature vector distribution. The results of speaker identification experiments are presented for both close-talking and distant talking data.


Cite as: Pan, Y., Waibel, A. (2000) The effects of room acoustics on MFCC speech parameter. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 4, 129-132

@inproceedings{pan00c_icslp,
  author={Yue Pan and Alex Waibel},
  title={{The effects of room acoustics on MFCC speech parameter}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 4, 129-132}
}