ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Monaural speech segregation based on pitch track correction using an ensemble kalman filter

Han-Gyu Kim, Gil-Jin Jang, Jeong-Sik Park, Yung-Hwan Oh

We propose a novel method of pitch track correction that uses an ensemble Kalman filter to improve the performance of monaural speech segregation. The proposed method considers all reliable pitch streaks for pitch track correction, whereas the conventional segregation approach relies on only the longest streak in a given speech stream. In addition, unreliable pitch streaks are corrected with an ensemble Kalman filter that uses autocorrelation functions as noisy observations for the hidden true pitch values. Our proposed approach provides more accurate pitch estimation, thus improving speech segregation performance for various types of noises, in particular, colored noise. In speech segregation experiments on mixtures of speech and various competing noises, the proposed method demonstrated superior performance to the conventional approach.


doi: 10.21437/Interspeech.2013-233

Cite as: Kim, H.-G., Jang, G.-J., Park, J.-S., Oh, Y.-H. (2013) Monaural speech segregation based on pitch track correction using an ensemble kalman filter. Proc. Interspeech 2013, 813-816, doi: 10.21437/Interspeech.2013-233

@inproceedings{kim13b_interspeech,
  author={Han-Gyu Kim and Gil-Jin Jang and Jeong-Sik Park and Yung-Hwan Oh},
  title={{Monaural speech segregation based on pitch track correction using an ensemble kalman filter}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={813--816},
  doi={10.21437/Interspeech.2013-233}
}