14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Monaural Speech Segregation Based on Pitch Track Correction Using an Ensemble Kalman Filter

Han-Gyu Kim (1), Gil-Jin Jang (2), Jeong-Sik Park (3), Yung-Hwan Oh (1)

(1) KAIST, Korea
(2) UNIST, Korea
(3) Mokwon University, Korea

We propose a novel method of pitch track correction that uses an ensemble Kalman filter to improve the performance of monaural speech segregation. The proposed method considers all reliable pitch streaks for pitch track correction, whereas the conventional segregation approach relies on only the longest streak in a given speech stream. In addition, unreliable pitch streaks are corrected with an ensemble Kalman filter that uses autocorrelation functions as noisy observations for the hidden true pitch values. Our proposed approach provides more accurate pitch estimation, thus improving speech segregation performance for various types of noises, in particular, colored noise. In speech segregation experiments on mixtures of speech and various competing noises, the proposed method demonstrated superior performance to the conventional approach.

Full Paper

Bibliographic reference.  Kim, Han-Gyu / Jang, Gil-Jin / Park, Jeong-Sik / Oh, Yung-Hwan (2013): "Monaural speech segregation based on pitch track correction using an ensemble kalman filter", In INTERSPEECH-2013, 813-816.