11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Automatic Selection of Thresholds for Signal Separation Algorithms Based on Interaural Delay

Chanwoo Kim (1), Richard M. Stern (1), Kiwan Eom (2), Jaewon Lee (2)

(1) Carnegie Mellon University, USA
(2) Samsung Electronics Co. Ltd., Korea

In this paper we describe a system that separates signals by comparing the interaural time delays (ITDs) of their time frequency components to a fixed threshold ITD. While in previous algorithms, the fixed threshold ITD had been obtained empirically from training data in a specific environment, in real environments the characteristics that affect the optimal value of this threshold are unknown and possibly time varying. If these configurations are different from the environment under which ITD threshold had been pre-computed, the performance of the source separation system is degraded. In this paper, we present an algorithm which chooses a threshold ITD that minimizes the cross-correlation of the target and interfering signals, after a compressive nonlinearity. We demonstrate that the algorithm described in this paper provides speech recognition accuracy that is much more robust to changes in environment than would be obtained using a fixed threshold ITD.

Full Paper

Bibliographic reference.  Kim, Chanwoo / Stern, Richard M. / Eom, Kiwan / Lee, Jaewon (2010): "Automatic selection of thresholds for signal separation algorithms based on interaural delay", In INTERSPEECH-2010, 729-732.