EUROSPEECH 2001 Scandinavia
This paper describes a stochastic modeling between an F0 contour and linguistic features of a sentence for speech synthesis. The F0 contour of a sentence is represented by concatenation of the F0 patterns of a Japanese syntactic unit, bunsetsu. A bunsetsu F0 pattern is composed of the F0 average and the F0 shape. The most probable sequence of bunsetsu F0 shapes for a sentence are found in the F0 shape database by a probabilistic measure. The probability that an F0 contour is observed for a sentence is defined by two kinds of probabilities, the F0 shape production and the F0 shape bigram. Several typical bunsetsu F0 shapes are extracted by clustering of training data and stored in the F0 shape database. The probability of the F0 shape production is computed for each bunsetsu based on the distribution of linguistic features in the cluster.
Bibliographic reference. Yamashita, Yoichi / Ishida, Tomoyoshi (2001): "Stochastic F0 contour model based on the clustering of F0 shapes of a syntactic unit", In EUROSPEECH-2001, 533-536.