12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

An Efficient Pre-Processing Scheme to Improve the Sound Source Localization System in Noisy Environment

Sheng-Chieh Lee (1), K. Bharanitharan (2), Bo-Wei Chen (1), Jhing-Fa Wang (1), Chung-Hsien Wu (1), Min-Jian Liao (1)

(1) National Cheng Kung University, Taiwan
(2) Korea University, Korea

In this study, we introduce an efficient pre-processing scheme for direction of arrival (DOA) estimation, which is capable of reducing the noise and reverberation effects in speech sound source localization. Furthermore, this presented system is also suitable for far-field speech localization. The adopted method of this proposed system can be simply subdivided into three stages: Linear phase-difference approximation, covariance matrix reconstruction, and frequency bin selection. The first two stages can initially decrease the influences of noise and reverberation; the last stage is used to filter the noise frequency bands according to the eigenvalue decomposition (EVD) of the covariance matrix. The experimental results show that our proposed system has effective performance of detecting different directions of speeches. For different signal-to-noise ratios (SNRs) speech signals, the average estimation errors can be decreased by about 5 to 7.5 degrees.

Full Paper

Bibliographic reference.  Lee, Sheng-Chieh / Bharanitharan, K. / Chen, Bo-Wei / Wang, Jhing-Fa / Wu, Chung-Hsien / Liao, Min-Jian (2011): "An efficient pre-processing scheme to improve the sound source localization system in noisy environment", In INTERSPEECH-2011, 2493-2496.