ISCA Archive Eurospeech 2001
ISCA Archive Eurospeech 2001

Improving simultaneous speech recognition in real room environments using overdetermined blind source separation

Athanasios Koutras, Evangelos Dermatas, George Kokkinakis

In this paper we present a novel solution to the Overdetermined Blind Speech Separation (OBSS) problem for improving speech recognition accuracy of N simultaneous speakers in real room environments using M (M>N) microphones. The proposed OBSS system uses basic NxN Blind Speech Separation networks that process in parallel all different combinations of the available mixture signals in the frequency domain, resulting to lower computational complexity and faster convergence. Extensive experiments using an array of two to ten microphones and two simultaneous speakers in a simulated real room, showed that when the number of the microphones increases beyond two, the separation performance is improved and the phoneme recognition accuracy of an HMM based decoder increases drastically (more than 6%). Therefore, the introduction of more microphones than speakers is justified in order to improve speech recognition accuracy in multi simultaneous speaker environments.


doi: 10.21437/Eurospeech.2001-290

Cite as: Koutras, A., Dermatas, E., Kokkinakis, G. (2001) Improving simultaneous speech recognition in real room environments using overdetermined blind source separation. Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 1009-1012, doi: 10.21437/Eurospeech.2001-290

@inproceedings{koutras01b_eurospeech,
  author={Athanasios Koutras and Evangelos Dermatas and George Kokkinakis},
  title={{Improving simultaneous speech recognition in real room environments using overdetermined blind source separation}},
  year=2001,
  booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)},
  pages={1009--1012},
  doi={10.21437/Eurospeech.2001-290}
}