ISCA Archive ICSLP 1994
ISCA Archive ICSLP 1994

A study on pitch pattern generation using HMM-based statistical information

Toshiaki Fukada, Yasuhiro Komori, Takashi Aso, Yasunori Ohora

This paper describes a novel pitch pattern generation method for speech synthesis using Hidden Markov Models (HMMs). In the proposed method, the F0 contours of minor phrase are modeled by HMMs (pitch-HMMs). The pitch-HMMs are trained using F0 and AF0considering phonetic environments (e.g. accent type, mora count, mora position, phonemic category, etc). To evaluate the pitch-HMMs, accent identification experiments are performed. The results indicate that the pitch-HMMs can capture the movement in F0 contours appropriately. In the F0 contour generation experiments, the proposed method yields an averaged root mean square error of 132cent (equivalent to 9.2Hz at 120Hz) between the original and the generated F0 contours. Furthermore, an application of the proposed method to text-to-speech system is also discussed.


Cite as: Fukada, T., Komori, Y., Aso, T., Ohora, Y. (1994) A study on pitch pattern generation using HMM-based statistical information. Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994), 723-726

@inproceedings{fukada94_icslp,
  author={Toshiaki Fukada and Yasuhiro Komori and Takashi Aso and Yasunori Ohora},
  title={{A study on pitch pattern generation using HMM-based statistical information}},
  year=1994,
  booktitle={Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994)},
  pages={723--726}
}