Many aspects of prosody prediction in speech synthesis could be improved, from placement of symbolic accent and phrase boundary markers to control of continuously varying parameters (e.g., duration, fundamental frequency). The goal of this work is to develop algorithms for predicting aspects of fundamental frequency typically said to have gradient variation: pitch range and prominence. In addition, the results of the automatic training methodology are used to investigate differences in prominence patterns associated with different genres of speech.
Cite as: Bulyko, I., Ostendorf, M. (1999) Predicting gradient F0 variation: pitch range and accent prominence. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 1819-1822, doi: 10.21437/Eurospeech.1999-396
@inproceedings{bulyko99_eurospeech, author={Ivan Bulyko and Mari Ostendorf}, title={{Predicting gradient F0 variation: pitch range and accent prominence}}, year=1999, booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)}, pages={1819--1822}, doi={10.21437/Eurospeech.1999-396} }