We have previously proposed a generative model of speech F0 contours, based on the discrete-time version of the Fujisaki model (a model of the mechanism for controlling F0s through laryngeal muscles). One advantage of this model is that it allows us to apply statistical methods to estimate the Fujisaki-model parameters from speech F0 contours. This paper proposes a new generative model of speech F0 contours incorporating a vocabulary model of intonation patterns. A parameter inference algorithm for the present model is derived. We quantitatively evaluated the performance of our parameter inference algorithm.
Bibliographic reference. Ishihara, Tatsuma / Kameoka, Hirokazu / Yoshizato, Kota / Saito, Daisuke / Sagayama, Shigeki (2013): "Probabilistic speech F0 contour model incorporating statistical vocabulary model of phrase-accent command sequence", In INTERSPEECH-2013, 1017-1021.