ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Interference robust DOA estimation of human speech by exploiting historical information and temporal correlation

Wei Xue, Shan Liang, Wenju Liu

Although various DOA estimation methods for human speech have been presented, most of them assume noises received by different microphones are undirected. However, strong directional interferences often also exist in practical scenarios and the performances of existing methods degrade seriously in such case. In this paper, we present a new interference robust DOA estimation method for human speech. Historical information and temporal correlation are taken advantage to deal with the problem. Firstly, utilizing the historical DOA estimates, we perform "post-beamforming" in the last frame to suppress the directional interferences. Then exploiting temporal correlation of speech spectra, frequency weights which highlight the effects of speech frequency bins are calculated based on the estimated a priori SNR of enhanced signal. Finally, we propose a new DOA cost function using frequency-weighted spatial correlation matrix to estimate the DOA of speech source. Experimental results show that the proposed method outperforms existing algorithms in reverberant environments with additive white Gaussian noises in the presence of different kinds of interferences.


doi: 10.21437/Interspeech.2013-647

Cite as: Xue, W., Liang, S., Liu, W. (2013) Interference robust DOA estimation of human speech by exploiting historical information and temporal correlation. Proc. Interspeech 2013, 2895-2899, doi: 10.21437/Interspeech.2013-647

@inproceedings{xue13b_interspeech,
  author={Wei Xue and Shan Liang and Wenju Liu},
  title={{Interference robust DOA estimation of human speech by exploiting historical information and temporal correlation}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={2895--2899},
  doi={10.21437/Interspeech.2013-647}
}