ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Improved speaker diarization of meeting speech with recurrent selection of representative speech segments and participant interaction pattern modeling

Kyu J. Han, Shrikanth S. Narayanan

In this work we describe two distinct novel improvements to our speaker diarization system, previously proposed for analysis of meeting speech. The first approach focuses on recurrent selection of representative speech segments for speaker clustering while the other is based on participant interaction pattern modeling. The former selects speech segments with high relevance to speaker clustering, especially from a robust cluster modeling perspective, and keeps updating them throughout clustering procedures. The latter statistically models conversation patterns between meeting participants and applies it as a priori information when refining diarization results. Experimental results reveal that the two proposed approaches provide performance enhancement by 29.82% (relative) in terms of diarization error rate in tests on 13 meeting excerpts from various meeting speech corpora.


doi: 10.21437/Interspeech.2009-327

Cite as: Han, K.J., Narayanan, S.S. (2009) Improved speaker diarization of meeting speech with recurrent selection of representative speech segments and participant interaction pattern modeling. Proc. Interspeech 2009, 1067-1070, doi: 10.21437/Interspeech.2009-327

@inproceedings{han09_interspeech,
  author={Kyu J. Han and Shrikanth S. Narayanan},
  title={{Improved speaker diarization of meeting speech with recurrent selection of representative speech segments and participant interaction pattern modeling}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={1067--1070},
  doi={10.21437/Interspeech.2009-327}
}