In this paper, we suggest a non-parametric way to improve the intelligibility of speech in noise. The signal is enhanced before presented in a noisy environment, under the constraint of equal global signal power before and after modifications. Two systems are combined in a cascade form to enhance the quality of the signal first in frequency (spectral shaping) and then in time (dynamic range compression). Experiments with speech shaped (SSN) and competed speaker (CS) types of noise at various low SNR values, show that the suggested approach outperforms state-of-the art methods in terms of the Speech Intelligibility Index (SII). In terms of SNR gain there is an improvement of 4 dB (SSN) and 8 dB (CS). A large formal listening test confirm the efficiency of the suggested system in enhancing speech intelligibility in noise.
Index Terms: speech-in-noise enhancement, speech intelligibility, spectral shaping, dynamic range compression
Bibliographic reference. Zorila, Tudor-Catalin / Kandia, Varvara / Stylianou, Yannis (2012): "Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression", In INTERSPEECH-2012, 635-638.