We describe a system for separating multiple sources from a two-channel recording based on interaural cues and known characteristics of the source signals. We combine a probabilistic model of the observed interaural level and phase differences with a prior model of the source statistics and derive an EM algorithm for finding the maximum likelihood parameters of the joint model. The system is able to separate more sound sources than there are observed channels. In simulated reverberant mixtures of three speakers the proposed algorithm gives a signal-to-noise ratio improvement of 2.1 dB over a baseline algorithm using only interaural cues.
Bibliographic reference. Weiss, Ron J. / Mandel, Michael I. / Ellis, Daniel P. W. (2008): "Source separation based on binaural cues and source model constraints", In INTERSPEECH-2008, 419-422.