ISCA Archive Eurospeech 1991
ISCA Archive Eurospeech 1991

Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP)

Hynek Hermansky, Nelson Morgan, Aruna Bayya, Phil Kohn

Human perception is generally insensitive to steady-state stimuli, responding largely to relative values of the input stimulus. In spite of that, most speech parameter estimation techniques use absolute spectral values. Consequently, they fail when the spectral values are modified by the frequency response of the communication channel. In response to this challenge, we have developed an analysis technique that is more robust to steady-state factors in speech. The approach appears to work well with realistic speech and channel deformations, and is conceptually simple and computationally efficient. We have conducted an experiment on recognition of telephone-quality digits with a distorted channel. The new method is described, and experimental results for realistic channel distortion are reported, showing order-of-magnitude improvements in error rate.


doi: 10.21437/Eurospeech.1991-312

Cite as: Hermansky, H., Morgan, N., Bayya, A., Kohn, P. (1991) Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP). Proc. 2nd European Conference on Speech Communication and Technology (Eurospeech 1991), 1367-1370, doi: 10.21437/Eurospeech.1991-312

@inproceedings{hermansky91b_eurospeech,
  author={Hynek Hermansky and Nelson Morgan and Aruna Bayya and Phil Kohn},
  title={{Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP)}},
  year=1991,
  booktitle={Proc. 2nd European Conference on Speech Communication and Technology (Eurospeech 1991)},
  pages={1367--1370},
  doi={10.21437/Eurospeech.1991-312}
}