ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

A pitch-based model for separation of reverberant speech

Nicoleta Roman, DeLiang Wang

In everyday listening, both background noise and reverberation degrade the speech signal. While monaural speech separation based on periodicity has achieved considerable progress in handling additive noise, little research has been devoted to reverberant scenarios. Reverberation smears the harmonic structure of speech signals, and our evaluations using a pitch-based separation algorithm show that an increase in the room reverberation time causes degradation in performance due to the loss in periodicity for the target signal. We propose a two-stage monaural speech separation system that combines the inverse filtering of the room impulse response corresponding to target location with a pitch-based speech segregation method. As a result of the first processing stage, the harmonicity of a signal arriving from target direction is partially restored while signals arriving from other locations are further smeared, and this leads to improved separation. A systematic evaluation shows that the proposed system results in considerable signal-to-noise ratio gains across different conditions.

doi: 10.21437/Interspeech.2005-687

Cite as: Roman, N., Wang, D. (2005) A pitch-based model for separation of reverberant speech. Proc. Interspeech 2005, 2109-2112, doi: 10.21437/Interspeech.2005-687

  author={Nicoleta Roman and DeLiang Wang},
  title={{A pitch-based model for separation of reverberant speech}},
  booktitle={Proc. Interspeech 2005},