ISCA Archive Eurospeech 2001
ISCA Archive Eurospeech 2001

Automatic prosody generation - a model for hungarian

Gábor Olaszy, Géza Németh, Péter Olaszi

In our model a complex function set is described for the three prosody components of read speech. Each of them is modelled separately by a three-step procedure. A new method, based on indirect determination of specific sound durations was developed. Final duration values are calculated from the specific durations in two further steps. F0 changes are also modelled by three levels, starting with rules on sentence level, followed by the word and syllable level, and completed by the micro intonation level. Another three level model serves the intensity structure, i.e. rules applied on sounds, on words and on the complete sentence. The three component models have influence on each other during prosody generation. Cross effects among them are also mentioned. The model can be applied in speech research and in applications (synthesis and recognition). It was tested for Hungarian. Keywords: prosody generation, three-level model, specific sound durations, word-level duration map

doi: 10.21437/Eurospeech.2001-141

Cite as: Olaszy, G., Németh, G., Olaszi, P. (2001) Automatic prosody generation - a model for hungarian. Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 525-528, doi: 10.21437/Eurospeech.2001-141

  author={Gábor Olaszy and Géza Németh and Péter Olaszi},
  title={{Automatic prosody generation - a model for hungarian}},
  booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)},