ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Performance evaluation of style adaptation for hidden semi-Markov model based speech synthesis

Makoto Tachibana, Junichi Yamagishi, Takashi Masuko, Takao Kobayashi

This paper describes a style adaptation technique using hidden semi-Markov model (HSMM) based maximum likelihood linear regression (MLLR). The HSMM-based MLLR technique can estimate regression matrices for affine transform of mean vectors of output and state duration distributions which maximize likelihood of adaptation data using EM algorithm. In this study, we apply this adaptation technique to style adaptation in HSMM-based speech synthesis. From the results of several subjective tests, we show that the HSMM-based MLLR technique can perform style adaptation with maintaining naturalness of the synthetic speech compared with the conventional HMM-based MLLR technique.


doi: 10.21437/Interspeech.2005-618

Cite as: Tachibana, M., Yamagishi, J., Masuko, T., Kobayashi, T. (2005) Performance evaluation of style adaptation for hidden semi-Markov model based speech synthesis. Proc. Interspeech 2005, 2805-2808, doi: 10.21437/Interspeech.2005-618

@inproceedings{tachibana05_interspeech,
  author={Makoto Tachibana and Junichi Yamagishi and Takashi Masuko and Takao Kobayashi},
  title={{Performance evaluation of style adaptation for hidden semi-Markov model based speech synthesis}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={2805--2808},
  doi={10.21437/Interspeech.2005-618}
}