9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Two Step Speaker Segmentation Method Using Bayesian Information Criterion and Adapted Gaussian Mixtures Models

Matej Grašič, Marko Kos, Andrej Žgank, Zdravko Kačič

University of Maribor, Slovenia

This paper addresses the topic of online unsupervised speaker segmentation in a complex audio environment as it is present in the Broadcast News databases. A new two stage speaker change detection algorithm is proposed, which combines the Bayesian Information Criterion with an ABLS-SCD statistical framework where adapted Gaussian mixture models are used to achieve higher accuracy. To enhance the performance of the proposed method a sub-window dependent threshold selection strategy for the ABLS-SCD is introduced. Also an additional window selection strategy for the proposed method is presented. Experimental design and test evaluation were carried out on the Slovenian BNSI Broadcast News database.

Full Paper

Bibliographic reference.  Grašič, Matej / Kos, Marko / Žgank, Andrej / Kačič, Zdravko (2008): "Two step speaker segmentation method using Bayesian information criterion and adapted Gaussian mixtures models", In INTERSPEECH-2008, 2514-2517.