12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Information Bottleneck Features for HMM/GMM Speaker Diarization of Meetings Recordings

Sree Harsha Yella, Fabio Valente

Idiap Research Institute, Switzerland

Improved diarization results can be obtained through combination of multiple systems. Several combination techniques have been proposed based on output voting, initialization and also integrated approaches. This paper proposes and investigates a novel approach to combine diarization systems through the use of features. A first diarization system, based on the Information Bottleneck, is used to generate a set of features that contain information relevant to the clustering. Those features are later used in conjunction with conventional MFCC in a second diarization system. This method is inspired from the TANDEM framework in ASR. While being fully integrated, the approach does not need modifications to any of the two systems in order to integrate the information. Experiments on 24 recordings from the NIST RT06/RT07/RT09 evaluations collected in five meeting rooms reveal that when the IB features are used together with MFCC, the total speaker error is reduced from

Full Paper

Bibliographic reference.  Yella, Sree Harsha / Valente, Fabio (2011): "Information bottleneck features for HMM/GMM speaker diarization of meetings recordings", In INTERSPEECH-2011, 953-956.