ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech

Kofi Boakye, Oriol Vinyals, Gerald Friedland

We present an update to our initial work [1] on overlapped speech detection for improving speaker diarization. Specifically, we describe the addition of new features and feature warping techniques that improve segmenter and, consequently, diarization performance. We also demonstrate improved diarization performance by additionally using overlap segment information in a new diarization pre-processing step which excludes overlap segments from speaker clustering. On a subset of the AMI Meeting Corpus we show that this overlap exclusion step nearly triples the relative improvement of diarization error rate as compared to overlap segment post-processing alone.


doi: 10.21437/Interspeech.2008-6

Cite as: Boakye, K., Vinyals, O., Friedland, G. (2008) Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech. Proc. Interspeech 2008, 32-35, doi: 10.21437/Interspeech.2008-6

@inproceedings{boakye08_interspeech,
  author={Kofi Boakye and Oriol Vinyals and Gerald Friedland},
  title={{Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={32--35},
  doi={10.21437/Interspeech.2008-6}
}