INTERSPEECH 2012
13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Modulation Domain Blind Source Separation for Noisy Speech Mixture

Yi Zhang, Yunxin Zhao

Department of Computer Science University of Missouri, Columbia, MO, USA

In this paper, we propose a noise-robust blind speech separation (BSS) method by using two microphones. We first use modulation domain real and imaginary spectral subtraction (MRISS) to enhance both magnitude and phase spectra of the speech mixture inputs. We then estimate the direction of arrivals (DOAs) of the speech sources and perform time-acoustic-modulation frequency masking to recover the source signals. Our experimental results in five types of noise conditions have showed the superior performance of the proposed method in comparison with the conventional acoustic-domain DOA based separation method.

Index Terms: time-frequency masking, direction of arrival, modulation frequency, blind speech separation

Full Paper

Bibliographic reference.  Zhang, Yi / Zhao, Yunxin (2012): "Modulation domain blind source separation for noisy speech mixture", In INTERSPEECH-2012, 1544-1547.