INTERSPEECH 2010
11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Convexity and Fast Speech Extraction by Split Bregman Method

Meng Yu (1), Wenye Ma (2), Jack Xin (1), Stanley Osher (2)

(1) University of California at Irvine, USA
(2) University of California at Los Angeles, USA

A fast speech extraction (FSE) method is presented using convex optimization made possible by pause detection of the speech sources. Sparse unmixing filters are sought by L1 regularization and the split Bregman method. A subdivided split Bregman method is developed for efficiently estimating long reverberations in real room recordings. The speech pause detection is based on a binary mask source separation method. The FSE method is evaluated and found to outperform existing blind speech separation approaches on both synthetic and room recorded data in terms of the overall computational speed and separation quality.

Full Paper

Bibliographic reference.  Yu, Meng / Ma, Wenye / Xin, Jack / Osher, Stanley (2010): "Convexity and fast speech extraction by split bregman method", In INTERSPEECH-2010, 398-401.