Hidden Markov Convolutive Mixture Model for Pitch Contour Analysis of Speech

Kota Yoshizato (1), Hirokazu Kameoka (1,2), Daisuke Saito (1), Shigeki Sagayama (1)

(1) Graduate School of Information Science and Technology, The University of Tokyo, Japan
(2) NTT Communication Science Laboratories, NTT Corporation, Japan

This paper proposes a statistical model of speech F0 contours, which is based on the discrete-time version of the Fujisaki model. Our motivation for formulating this model is incorporating F0 contours into various statistical speech processing problems. In this paper, we describe the formulation of the model and quantitatively evaluates the performance of the model through Fujisaki-model parameter estimations from real speech F0 contours. Compared with another speech F0 model we have proposed, the present model prefer fitting observed F0 contours because the previous model is based on a squared error criterion in the Fujisaki-model commands domain and the present model is in the F0 contours domain.

Index Terms: speech F0 contours, statistical model, Fujisaki model, hidden Markov model, EM algorithm

Bibliographic reference.  Yoshizato, Kota / Kameoka, Hirokazu / Saito, Daisuke / Sagayama, Shigeki (2012): "Hidden Markov convolutive mixture model for pitch contour analysis of speech", In INTERSPEECH-2012, 390-393.