10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Speech Overlap Detection in a Two-Pass Speaker Diarization System

Marijn Huijbregts (1), David A. van Leeuwen (2), Franciska M. G. de Jong (1)

(1) University of Twente, The Netherlands
(2) TNO Human Factors, The Netherlands

In this paper we present the two-pass speaker diarization system that we developed for the NIST RT09s evaluation. In the first pass of our system a model for speech overlap detection is generated automatically. This model is used in two ways to reduce the diarization errors due to overlapping speech. First, it is used in a second diarization pass to remove overlapping speech from the data while training the speaker models. Second, it is used to find speech overlap for the final segmentation so that overlapping speech segments can be generated. The experiments show that our overlap detection method improves the performance of all three of our system configurations.

Full Paper

Bibliographic reference.  Huijbregts, Marijn / Leeuwen, David A. van / Jong, Franciska M. G. de (2009): "Speech overlap detection in a two-pass speaker diarization system", In INTERSPEECH-2009, 1063-1066.