ISCA Archive Interspeech 2006
ISCA Archive Interspeech 2006

Improving the performance of HMM-based voice conversion using context clustering decision tree and appropriate regression matrix format

Long Qin, Yi-Jian Wu, Zhen-Hua Ling, Ren-Hua Wang

To improve the performance of the HMM-based voice conversion system in which the LSP coefficient is introduced as the spectral representation, a model clustering technique to tie HMMs into classes for the model adaptation, considering the phonetic and linguistic contextual factors of HMMs, is adopted in this paper. Besides, due to the relationship between the LSP coefficients of adjacent orders, an appropriate format of the regression matrix is suggested according to the small amount of the adaptation training data. Subjective and objective tests prove that the source HMMs can be adapted more accurately using the proposed method, meanwhile the synthetic speech generated from the adapted model has better discrimination and speech quality.


doi: 10.21437/Interspeech.2006-578

Cite as: Qin, L., Wu, Y.-J., Ling, Z.-H., Wang, R.-H. (2006) Improving the performance of HMM-based voice conversion using context clustering decision tree and appropriate regression matrix format. Proc. Interspeech 2006, paper 1105-Thu1BuP.1, doi: 10.21437/Interspeech.2006-578

@inproceedings{qin06_interspeech,
  author={Long Qin and Yi-Jian Wu and Zhen-Hua Ling and Ren-Hua Wang},
  title={{Improving the performance of HMM-based voice conversion using context clustering decision tree and appropriate regression matrix format}},
  year=2006,
  booktitle={Proc. Interspeech 2006},
  pages={paper 1105-Thu1BuP.1},
  doi={10.21437/Interspeech.2006-578}
}