12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Underdetermined Blind Source Separation with Fuzzy Clustering for Arbitrarily Arranged Sensors

Ingrid Jafari (1), Serajul Haque (1), Roberto Togneri (1), Sven Nordholm (2)

(1) University of Western Australia, Australia
(2) Curtin University, Australia

Recently, the concept of time-frequency masking has developed as an important approach to the blind source separation problem, particularly when in the presence of reverberation. However, previous research has been limited by factors such as the sensor arrangement and/or the mask estimation technique implemented. This paper presents a novel integration of two established approaches to BSS in an effort to overcome such limitations. A multidimensional feature vector is extracted from a non-linear sensor arrangement, and the fuzzy c-means algorithm is then applied to cluster the feature vectors into representations of the source speakers. Fuzzy time-frequency masks are estimated and applied to the observations for source recovery. The evaluations on the proposed study demonstrated improved separation quality over all test conditions. This establishes the potential of multidimensional fuzzy c-means clustering for mask estimation in the context of blind source separation.

Full Paper

Bibliographic reference.  Jafari, Ingrid / Haque, Serajul / Togneri, Roberto / Nordholm, Sven (2011): "Underdetermined blind source separation with fuzzy clustering for arbitrarily arranged sensors", In INTERSPEECH-2011, 1753-1756.