ISCA Archive Eurospeech 1991
ISCA Archive Eurospeech 1991

A comparative study of parameters and distances for noisy speech recognition

Javier Hernando, Climent Nadeu

Speech recognition in noisy environments remains an unsolved problem even in the case of isolated word recognition with small vocabularies. Recently, several techniques have been proposed to alleviate this problem. Concretely, the Short-Time Modified Coherence (SMC) parameterization and the Cepstral Projection Distortion (CPD) measure have shown excellent results when tested in a speech recognition system based on Dynamic Time Warping (DTW) and using speech contaminated by additive white noise. In this paper, a new technique based on the AR modeling of the one-sided autocorrelation sequence (OS ALPC) is presented and, from a comparative study of these LPC-based techniques in the Hidden Markov Model (HMM) approach, two main conclusions are attained: 1) the slope cepstral window and a relatively high model order are preferable, and 2) the cepstral representation based on the autocorrelation (rather on the signal) modeling achieves excellent results.


doi: 10.21437/Eurospeech.1991-19

Cite as: Hernando, J., Nadeu, C. (1991) A comparative study of parameters and distances for noisy speech recognition. Proc. 2nd European Conference on Speech Communication and Technology (Eurospeech 1991), 91-94, doi: 10.21437/Eurospeech.1991-19

@inproceedings{hernando91_eurospeech,
  author={Javier Hernando and Climent Nadeu},
  title={{A comparative study of parameters and distances for noisy speech recognition}},
  year=1991,
  booktitle={Proc. 2nd European Conference on Speech Communication and Technology (Eurospeech 1991)},
  pages={91--94},
  doi={10.21437/Eurospeech.1991-19}
}