ISCA Archive Interspeech 2021
ISCA Archive Interspeech 2021

Speaker Verification-Based Evaluation of Single-Channel Speech Separation

Matthew Maciejewski, Shinji Watanabe, Sanjeev Khudanpur

Speech enhancement techniques typically focus on intrinsic metrics of signal quality. The overwhelming majority of deep learning-based single-channel speech separation studies, for instance, have relied on a single class of metrics to evaluate the systems by. These metrics, usually variants of Signal-to-Distortion Ratio (SDR), measure fidelity to the “ground truth” waveform. This can be problematic, not only for lack of diversity in evaluation metrics, but also in cases where a perfect ground truth waveform may be unavailable. In this work, we explore the value of speaker verification as an extrinsic metric of separation quality, with additional utility as evidence of the benefits of separation as pre-processing for downstream tasks.


doi: 10.21437/Interspeech.2021-1924

Cite as: Maciejewski, M., Watanabe, S., Khudanpur, S. (2021) Speaker Verification-Based Evaluation of Single-Channel Speech Separation. Proc. Interspeech 2021, 3520-3524, doi: 10.21437/Interspeech.2021-1924

@inproceedings{maciejewski21_interspeech,
  author={Matthew Maciejewski and Shinji Watanabe and Sanjeev Khudanpur},
  title={{Speaker Verification-Based Evaluation of Single-Channel Speech Separation}},
  year=2021,
  booktitle={Proc. Interspeech 2021},
  pages={3520--3524},
  doi={10.21437/Interspeech.2021-1924}
}