ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Source separation based on binaural cues and source model constraints

Ron J. Weiss, Michael I. Mandel, Daniel P. W. Ellis

We describe a system for separating multiple sources from a two-channel recording based on interaural cues and known characteristics of the source signals. We combine a probabilistic model of the observed interaural level and phase differences with a prior model of the source statistics and derive an EM algorithm for finding the maximum likelihood parameters of the joint model. The system is able to separate more sound sources than there are observed channels. In simulated reverberant mixtures of three speakers the proposed algorithm gives a signal-to-noise ratio improvement of 2.1 dB over a baseline algorithm using only interaural cues.


doi: 10.21437/Interspeech.2008-51

Cite as: Weiss, R.J., Mandel, M.I., Ellis, D.P.W. (2008) Source separation based on binaural cues and source model constraints. Proc. Interspeech 2008, 419-422, doi: 10.21437/Interspeech.2008-51

@inproceedings{weiss08b_interspeech,
  author={Ron J. Weiss and Michael I. Mandel and Daniel P. W. Ellis},
  title={{Source separation based on binaural cues and source model constraints}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={419--422},
  doi={10.21437/Interspeech.2008-51}
}