First European Conference on Speech Communication and Technology

Paris, France
September 27-29, 1989

Estimation of Formants in Noise Corrupted Speech Using Auditory Models

F. K. Fink, Paul Dalsgaard

Speech Technology Centre, Institute of Electronic Systems, University of Aalborg, Aalborg, Denmark

In many practical applications involving speech recognition it is of great importance to be able to handle noise suppression in the preprocessing stage. In this paper we describe different front-end processing systems, one based on speech production modelling and two based on auditory modelling, and present results on their formant estimation abilities when being excitated by speech signals contaminated by noise. Three noise types - car, cocktailparty and open-plan office noise - are added to speech signals at signal-to-noise ratios varying between 20 and -10 dB. The results show that preprocessing using auditory modelling is much more robust to noise than speech production modelling, and that formants can still be reliably estimated at SNR = -5 dB for speech signal contaminated by car noise.

Full Paper

Bibliographic reference.  Fink, F. K. / Dalsgaard, Paul (1989): "Estimation of formants in noise corrupted speech using auditory models", In EUROSPEECH-1989, 2677-2680.