8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Enhancement of Reverberant Speech using Excitation Source Information

M. Chaitanya, S. R. M. Prasanna, Bayya Yegnanarayana

Indian Instiute of Technology Madras, India

This paper proposes a method for the enhancement of reverberant speech using the knowledge of the excitation source of speech production. The degradation level in the reverberant speech is measured in terms of Speech-to Reverberation component Ratio (SRR). From perception and processing point of view high SRR regions are important. Hence the proposed method identifies and enhances the speech in high SRR regions. The high SRR regions are identified using the Hilbert envelope of the Linear Prediction (LP) residual, which contains information about the excitation source of speech production. The Hilbert envelope of the LP residual derived from the reverberant speech is processed by the covariance analysis to derive the weight function. The LP residual of the reverberant speech is multiplied with the weight function to enhance the excitations of speech in the high SRR regions. The speech signal synthesized from the modified LP residual is found to be less reverberant.

Full Paper

Bibliographic reference.  Chaitanya, M. / Prasanna, S. R. M. / Yegnanarayana, Bayya (2004): "Enhancement of reverberant speech using excitation source information", In INTERSPEECH-2004, 2645-2648.