ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Using decision trees within the tilt intonation model to predict F0 contours

Kurt E. Dusterhoff, Alan W. Black, Paul Taylor

This paper presents an intonation generation system for use in a text-to-speech synthesis system. The intonation generation system uses classification trees to predict intonation event location and regression trees to predict parameters relating to the F0 shape for the predicted events. The decision trees model intonation within the Tilt intonation model, which provides a parameterized description of fundmaental frequency and an intuitive labelling scheme. The event location trees predict an event class (e.g. accent, boundary, none) for each syllable in an utterance based on local and global context (e.g. stress, phrasing, part of speech). The parameter prediction trees then provide the parameterized description of each intonation event based on similar context features. Informal results of the full system are presented together with results for the individual components.


doi: 10.21437/Eurospeech.1999-369

Cite as: Dusterhoff, K.E., Black, A.W., Taylor, P. (1999) Using decision trees within the tilt intonation model to predict F0 contours. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 1627-1630, doi: 10.21437/Eurospeech.1999-369

@inproceedings{dusterhoff99_eurospeech,
  author={Kurt E. Dusterhoff and Alan W. Black and Paul Taylor},
  title={{Using decision trees within the tilt intonation model to predict F0 contours}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={1627--1630},
  doi={10.21437/Eurospeech.1999-369}
}