ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Signal separation for robust speech recognition based on phase difference information obtained in the frequency domain

Chanwoo Kim, Kshitiz Kumar, Bhiksha Raj, Richard M. Stern

In this paper, we present a new two-microphone approach that improves speech recognition accuracy when speech is masked by other speech. The algorithm improves on previous systems that have been successful in separating signals based on differences in arrival time of signal components from two microphones. The present algorithm differs from these efforts in that the signal selection takes place in the frequency domain. We observe that additional smoothing of the phase estimates over time and frequency is needed to support adequate speech recognition performance. We demonstrate that the algorithm described in this paper provides better recognition accuracy than time-domain-based signal separation algorithms, and at less than 10 percent of the computation cost.


doi: 10.21437/Interspeech.2009-372

Cite as: Kim, C., Kumar, K., Raj, B., Stern, R.M. (2009) Signal separation for robust speech recognition based on phase difference information obtained in the frequency domain. Proc. Interspeech 2009, 2495-2498, doi: 10.21437/Interspeech.2009-372

@inproceedings{kim09e_interspeech,
  author={Chanwoo Kim and Kshitiz Kumar and Bhiksha Raj and Richard M. Stern},
  title={{Signal separation for robust speech recognition based on phase difference information obtained in the frequency domain}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2495--2498},
  doi={10.21437/Interspeech.2009-372}
}