ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Robust speaker change detection using Kernel-Gaussian model

Jie Gao, Xiang Zhang, Qingwei Zhao, Yonghong Yan

This paper introduces and evaluates a novel approach for unsupervised speaker change detection. In many unsupervised speaker change detection algorithms, each audio segment is typically modeled with a multivariate single Gaussian density, where it is assumed that the distribution of the speech features of the segment is Gaussian. However, this assumption is too strong in many cases. Therefore, this paper presents an alternative to the single Gaussian model: Gaussian model in reproducing kernel Hilbert space (RKHS) or Kernel-Gaussian model (KGM). KGM first projects speech features into RKHS via a nonlinear mapping. Then it models the features in RKHS with a Gaussian density. The mapping procedure enables KGM to capture nonlinear structure of speech features. An implementation of KGM is proposed and evaluated. Experiments on different datasets show that better results are achieved by KGM compared to the single Gaussian model.


doi: 10.21437/Interspeech.2008-618

Cite as: Gao, J., Zhang, X., Zhao, Q., Yan, Y. (2008) Robust speaker change detection using Kernel-Gaussian model. Proc. Interspeech 2008, 2494-2497, doi: 10.21437/Interspeech.2008-618

@inproceedings{gao08b_interspeech,
  author={Jie Gao and Xiang Zhang and Qingwei Zhao and Yonghong Yan},
  title={{Robust speaker change detection using Kernel-Gaussian model}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={2494--2497},
  doi={10.21437/Interspeech.2008-618}
}