ISCA Archive CHiME 2018
ISCA Archive CHiME 2018

Front-end processing for the CHiME-5 dinner party scenario

Christoph Boeddecker, Jens Heitkaemper, Joerg Schmalenstroeer, Lukas Drude, Jahn Heymann, Reinhold Haeb-Umbach

This contribution presents a speech enhancement system for the CHiME-5 Dinner Party Scenario. The front-end employs multi-channel linear time-variant filtering and achieves its gains without the use of a neural network. We present an adaptation of blind source separation techniques to the CHiME-5 database which we call Guided Source Separation (GSS). Using the baseline acoustic and language model, the combination of Weighted Prediction Error based dereverberation, guided source separation, and beamforming reduces the WER by 10.54% (relative) for the single array track and by 21.12 % (relative) on the multiple array track.


doi: 10.21437/CHiME.2018-8

Cite as: Boeddecker, C., Heitkaemper, J., Schmalenstroeer, J., Drude, L., Heymann, J., Haeb-Umbach, R. (2018) Front-end processing for the CHiME-5 dinner party scenario. Proc. 5th International Workshop on Speech Processing in Everyday Environments (CHiME 2018), 35-40, doi: 10.21437/CHiME.2018-8

@inproceedings{boeddecker18_chime,
  author={Christoph Boeddecker and Jens Heitkaemper and Joerg Schmalenstroeer and Lukas Drude and Jahn Heymann and Reinhold Haeb-Umbach},
  title={{Front-end processing for the CHiME-5 dinner party scenario}},
  year=2018,
  booktitle={Proc. 5th International Workshop on Speech Processing in Everyday Environments (CHiME 2018)},
  pages={35--40},
  doi={10.21437/CHiME.2018-8}
}