ISCA Archive ICSLP 1994
ISCA Archive ICSLP 1994

A comparison of three noisy speech recognition approaches

Olivier Siohan, Yifan Gong, Jean-Paul Haton

We compare 3 recent approaches dealing with speech recognition in noisy environment. The first approach is based on stochastic model combination of noise and speech . Given a clean speech model based on speech trajectories and an HMM noise model, this method aims at deriving a noisy speech model, in order to recognise noisy speech. In the second approach, we perform a mapping between the noisy and the clean speech space. The noisy speech is recognised after mapping to the clean space, using clean speech models. In the last approach, LDA is used as a preprocessing, and the training and testing environmental conditions are identical. On a 206 isolated word recognition task under different noisy environment, LDA gave the best results. The model combination proved to be efficient at high SNR, but performances fell down at low SNR. The mapping approach showed to be very robust, but led to the lowest recognition rate at high SNR.


Cite as: Siohan, O., Gong, Y., Haton, J.-P. (1994) A comparison of three noisy speech recognition approaches. Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994), 1031-1034

@inproceedings{siohan94_icslp,
  author={Olivier Siohan and Yifan Gong and Jean-Paul Haton},
  title={{A comparison of three noisy speech recognition approaches}},
  year=1994,
  booktitle={Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994)},
  pages={1031--1034}
}