ISCA Archive Odyssey 2004
ISCA Archive Odyssey 2004

Unsupervised speaker segmentation of broadcast news using MDL-based Gaussian model

Jia-Hsin Hsieh, Chung-Hsien Wu

This paper proposes an approach for unsupervised speaker segmentation and gender discrimination of broadcast news. In this paradigm, a speaker segmentation mechanism using MDL-based Gaussian model is firstly adopted to determine the speaker changes using mean and covariance of the Gaussian model. These speaker segments partitioned by speaker changes are smoothed and discriminated into male or female. Experimental results show the proposed method achieved a better performance with 9.2% missed detection rate and 7.5% false alarm rate compared to the Delta-BIC method for speaker segmentation on broadcast news. In addition, the segment-based gender discrimination improves 9% accuracy compared to the clip-based discriminator.


Cite as: Hsieh, J.-H., Wu, C.-H. (2004) Unsupervised speaker segmentation of broadcast news using MDL-based Gaussian model. Proc. The Speaker and Language Recognition Workshop (Odyssey 2004), 345-348

@inproceedings{hsieh04_odyssey,
  author={Jia-Hsin Hsieh and Chung-Hsien Wu},
  title={{Unsupervised speaker segmentation of broadcast news using MDL-based Gaussian model}},
  year=2004,
  booktitle={Proc. The Speaker and Language Recognition Workshop (Odyssey 2004)},
  pages={345--348}
}