15th Annual Conference of the International Speech Communication Association

September 14-18, 2014

Enhancing Audio Source Separability Using Spectro-Temporal Regularization with NMF

Colin Vaz (1), Dimitrios Dimitriadis (2), Shrikanth S. Narayanan (1)

(1) University of Southern California, USA
(2) AT&T Labs Research, USA

We propose a spectro-temporal regularization approach for NMF that accounts for a source's spectral variability over time. The regularization terms allow NMF to adapt the spectral basis matrices optimally to reduce mismatch between the spectral characteristics of sources observed during training and encountered during separation. We first tested our algorithm on a simulated source separation task. Preliminary results show significant improvement of SAR, SDR, and SIR values over some current NMF methods. We also tested our algorithm on a speech enhancement task and were able to show a modest improvement of the PESQ scores of the recovered speech.

Full Paper

Bibliographic reference.  Vaz, Colin / Dimitriadis, Dimitrios / Narayanan, Shrikanth S. (2014): "Enhancing audio source separability using spectro-temporal regularization with NMF", In INTERSPEECH-2014, 855-859.