9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Psychoacoustically-Motivated Adaptive β-Order Generalized Spectral Subtraction Based on Data-Driven Optimization

Junfeng Li (1), Hui Jiang (2), Masato Akagi (1)

(1) JAIST, Japan; (2) York University, Canada

To mitigate the performance limitations caused by the constant spectral order β in the traditional spectral subtraction methods, we previously presented an adaptive β-order generalized spectral subtraction (GSS) in which the spectral order β is updated in a heuristic way. In this paper, we propose a psychoacousticallymotivated adaptive β-order GSS, by considering that different frequency bands contribute different amounts to speech intelligibility (i.e., the band-importance function). Specifically, in this proposed adaptive β-order GSS, the tendency of spectral order β to change with the input local signal-to-noise ratio (SNR) is quantitatively approximated by a sigmoid function, which is derived through a data-driven optimization procedure by minimizing the intelligibility-weighted distance between the desired speech spectrum and its estimate. The inherent parameters of the sigmoid function are further optimized with the data-driven optimization procedure. Experimental results indicate that the proposed psychoacoustically-motivated adaptive β-order GSS yields great improvements over the traditional spectral subtraction methods with the intelligibility-weighted measures.

Full Paper

Bibliographic reference.  Li, Junfeng / Jiang, Hui / Akagi, Masato (2008): "Psychoacoustically-motivated adaptive β-order generalized spectral subtraction based on data-driven optimization", In INTERSPEECH-2008, 171-174.