EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology
2nd INTERSPEECH Event

Aalborg, Denmark
September 3-7, 2001

                 

Blind Speech Separation of Moving Speakers Using Hybrid Neural Networks

Athanasios Koutras, Evangelos Dermatas, George Kokkinakis

University of Patras, Greece

In this paper we present a novel method for Blind Speech Separation of convolutive speech signals of moving speakers in highly reverberant rooms. The separation network used is a hybrid neural network, which performs separation of convolutive speech mixtures in the time domain, without any prior knowledge of the propagation media, based on the Maximum Likelihood Estimation (MLE) principle. The proposed method improves significantly (more than 13% in all adverse mixing situations) the performance of a phoneme-based continuous speech recognition system and therefore can be used as a front-end to separate simultaneous speech of speakers who are moving in reverberant rooms.

Full Paper

Bibliographic reference.  Koutras, Athanasios / Dermatas, Evangelos / Kokkinakis, George (2001): "Blind speech separation of moving speakers using hybrid neural networks", In EUROSPEECH-2001, 997-1000.