ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Optimal model order selection based on regression tree in speaker identification

Shilei Zhang, Junmei Bai, Shuwu Zhang, Bo Xu

In this paper we propose a new method to select the optimal model order for the initialization of Gaussian Mixture speaker Models (GMM) based on regression tree in text-independent speaker identification system. The objective is to choose the optimal number of components which is necessary to adequately model a speaker for a good speaker identification performance according to the Bayesian Information Criterion (BIC) and agglomerative clustering. One obvious advantage of such method is that it provides a flexible framework to select an optimal speaker model order based on the training data for each client speaker. The experimental results on the YOHO corpus show that adaptive model mixture components achieves better performance, especially considering the fact that different speakers have different amounts of available enrollment data.


doi: 10.21437/Interspeech.2005-639

Cite as: Zhang, S., Bai, J., Zhang, S., Xu, B. (2005) Optimal model order selection based on regression tree in speaker identification. Proc. Interspeech 2005, 2045-2048, doi: 10.21437/Interspeech.2005-639

@inproceedings{zhang05f_interspeech,
  author={Shilei Zhang and Junmei Bai and Shuwu Zhang and Bo Xu},
  title={{Optimal model order selection based on regression tree in speaker identification}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={2045--2048},
  doi={10.21437/Interspeech.2005-639}
}