ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Sound segregation based on binaural zero-crossings

Young-Ik Kim, Sung Jun An, Rhee Man Kil, Hyung-Min Park

This paper presents a new method of sound segregation based on zero-crossings generated from binaural filter-bank outputs. In our approach, sound source directions are identified using the spatial cues such as inter-aural time differences (ITDs) and inter-aural intensity differences (IIDs). The estimation of ITDs is performed using zero-crossings generated from binaural filter-bank outputs to get more reliable ITD samples in noisy environments. We also consider the estimation of ITDs with the aid of IID samples to cope with the phase ambiguities of ITD samples in high frequencies. As a result, the proposed method is able to provide an accurate estimate of sound source directions which gives us a good masking scheme for sound segregation while offering significantly less computational complexity compared to cross-correlation based methods.

doi: 10.21437/Interspeech.2005-742

Cite as: Kim, Y.-I., An, S.J., Kil, R.M., Park, H.-M. (2005) Sound segregation based on binaural zero-crossings. Proc. Interspeech 2005, 2325-2328, doi: 10.21437/Interspeech.2005-742

  author={Young-Ik Kim and Sung Jun An and Rhee Man Kil and Hyung-Min Park},
  title={{Sound segregation based on binaural zero-crossings}},
  booktitle={Proc. Interspeech 2005},