11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Energy Reallocation Strategies for Speech Enhancement in Known Noise Conditions

Yan Tang, Martin Cooke

Universidad del Pais Vasco, Spain

Speech output, whether live, recorded or synthetic, is often employed in difficult listening conditions. Context-sensitive speech modifications aim to promote intelligibility while maintaining quality and listener comfort. The current study used objective measures of intelligibility and quality to compare five energy reallocation strategies operating under equal energy and preserved duration constraints. Results in both stationary and highly-nonstationary backgrounds suggest that time-varying modifications lead to large increases in objective intelligibility, but that speech quality is best preserved by time-invariant modifications. Selective amplification of time-frequency regions with low a priori SNR produced the highest objective intelligibility without severe disruption to quality.

Full Paper

Bibliographic reference.  Tang, Yan / Cooke, Martin (2010): "Energy reallocation strategies for speech enhancement in known noise conditions", In INTERSPEECH-2010, 1636-1639.