Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Speech Enhancement Based on a Constrained Sinusoidal Model

Jesper Jensen (1), John H. L. Hansen (2)

(1) Center for PersonKommunikation (CPK), Aalborg University, Denmark
(2) Robust Speech Processing Lab., Center for Spoken Language Research, University of Colorado at Boulder, CO, USA

In this study we propose an algorithm for enhancement of speech degraded by additive broad-band noise. The algorithm represents speech using a sinusoidal model, where model parameters are estimated iteratively. In order to ensure speech-like characteristics observed in clean speech, the model parameters are restricted to satisfy certain smoothness constraints. The algorithm is evaluated using speech signals degraded by additive white Gaussian noise. Results from both objective and subjective evaluations show considerable improvement over traditional spectral subtraction and Wiener filtering based schemes. In particular, in a subjective AB preference test, where enhanced signals were encoded/decoded with the G729 speech codec, the proposed scheme was preferred over the traditional schemes in more than 5 out of 6 cases for input SNRs ranging from 5-20 dB.

Full Paper

Bibliographic reference.  Jensen, Jesper / Hansen, John H. L. (2000): "Speech enhancement based on a constrained sinusoidal model", In ICSLP-2000, vol.2, 214-217.