INTERSPEECH 2012
13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Speech-in-Noise Intelligibility Improvement Based on Spectral Shaping and Dynamic Range Compression

Tudor-Catalin Zorila, Varvara Kandia, Yannis Stylianou

(1) Telecommunication Department, Politehnica University of Bucharest (UPB), Romania
(2) ICS-FORTH and Computer Science Department, University of Crete, Heraklion, Crete, Greece

In this paper, we suggest a non-parametric way to improve the intelligibility of speech in noise. The signal is enhanced before presented in a noisy environment, under the constraint of equal global signal power before and after modifications. Two systems are combined in a cascade form to enhance the quality of the signal first in frequency (spectral shaping) and then in time (dynamic range compression). Experiments with speech shaped (SSN) and competed speaker (CS) types of noise at various low SNR values, show that the suggested approach outperforms state-of-the art methods in terms of the Speech Intelligibility Index (SII). In terms of SNR gain there is an improvement of 4 dB (SSN) and 8 dB (CS). A large formal listening test confirm the efficiency of the suggested system in enhancing speech intelligibility in noise.

Index Terms: speech-in-noise enhancement, speech intelligibility, spectral shaping, dynamic range compression

Full Paper

Bibliographic reference.  Zorila, Tudor-Catalin / Kandia, Varvara / Stylianou, Yannis (2012): "Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression", In INTERSPEECH-2012, 635-638.