INTERSPEECH 2008
9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Source Separation Based on Binaural Cues and Source Model Constraints

Ron J. Weiss, Michael I. Mandel, Daniel P. W. Ellis

Columbia University, USA

We describe a system for separating multiple sources from a two-channel recording based on interaural cues and known characteristics of the source signals. We combine a probabilistic model of the observed interaural level and phase differences with a prior model of the source statistics and derive an EM algorithm for finding the maximum likelihood parameters of the joint model. The system is able to separate more sound sources than there are observed channels. In simulated reverberant mixtures of three speakers the proposed algorithm gives a signal-to-noise ratio improvement of 2.1 dB over a baseline algorithm using only interaural cues.

Full Paper

Bibliographic reference.  Weiss, Ron J. / Mandel, Michael I. / Ellis, Daniel P. W. (2008): "Source separation based on binaural cues and source model constraints", In INTERSPEECH-2008, 419-422.