INTERSPEECH 2004 - ICSLP
8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Sound Source Localization Based on Zero-Crosing Peak-Amplitude Coding

Young-Ik Kim, Rhee Man Kil

KAIST, Korea

This paper presents a new method of sound source localization based on zero-crossings generated from binaural filter-bank outputs. To detect the sound source direction, in the conventional methods the inter-aural time differences (ITDs) and the inter-aural intensity differences (IIDs) are estimated using the cross-correlations of neuronal firing rates. However, these methods require high computational complexity involved in the computation of cross-correlations and they suffer from inaccuracies in estimating the ITDs and IIDs, especially in a noisy multi-source environment. In this context, we propose a method using zero-crossing and peak-amplitudes (ZCPAs) of binaural filter-bank outputs for accurate and efficient estimation of ITDs and IIDs. The proposed method is able to provide accurate estimation of the sound source direction and robustness to noise while offering significantly less computational complexity compared to cross-correlation based methods.

Full Paper

Bibliographic reference.  Kim, Young-Ik / Kil, Rhee Man (2004): "Sound source localization based on zero-crosing peak-amplitude coding", In INTERSPEECH-2004, 477-480.