ISCA Archive SPAC 1992
ISCA Archive SPAC 1992

Speech enhancement using a statistically derived filter mapping

Yan Ming Cheng, Douglas O'Shaughnessy, Peter Kabal

We view the speech enhancement task in two aspects: reduction of the perceptual noise level in degraded speech and reconstruction of the degraded information, which may result in improvement of speech intelligibility. We are also very interested in noise-independent speech enhancement where test noise environments could differ in intensity from those of algorithm development. To this end, we have developed in this paper an algorithm called Noise-Independent Statistical Spectral Mapping (NISSM) to estimate a speech enhancement Wiener filter. NISSM consists of a noise-resist ant transformation, which converts noisy speech to a set of noise-resist ant features, and a spectral mapping function, which maps the features to autoregressive spectra of clean speech. We will show that the proposed algorithm effectively reduces noise intensity. When the noise intensity of training differs from that of testing, NISSM outperforms significantly a conventional spectral mapping. The algorithm operates frame-by-frame and is designed for real-time application. The noise interference could be stationary or non-stationary white noise with variable intensity.

Cite as: Cheng, Y.M., O'Shaughnessy, D., Kabal, P. (1992) Speech enhancement using a statistically derived filter mapping. Proc. ETRW on Speech Processing in Adverse Conditions, 127-130

  author={Yan Ming Cheng and Douglas O'Shaughnessy and Peter Kabal},
  title={{Speech enhancement using a statistically derived filter mapping}},
  booktitle={Proc. ETRW on Speech Processing in Adverse Conditions},