8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

A Minimum Mean Squared Error Estimator for Single Channel Speaker Separation

Aarthi M. Reddy, Bhiksha Raj

Mitsubishi Electric Research Labs, USA

The problem of separating out the signals for multiple speakers from a single mixed recording has received considerable attention in recent times. Most current techniques are based on the principle of masking: in order the separate out the signal for any speaker, frequency components that are not believed to belong to that speaker are suppressed. The signals for the various speakers are reconstructed from the partial spectral information that remains. In this paper we present a different kind of technique -- one that attempts to estimate all spectral components for the desired speaker. Separated signals are derived from the complete spectral descriptions so obtained. Experiments show that this method results in superior reconstruction to masking based reconstruction.

Full Paper

Bibliographic reference.  Reddy, Aarthi M. / Raj, Bhiksha (2004): "A minimum mean squared error estimator for single channel speaker separation", In INTERSPEECH-2004, 2445-2448.