ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Learning the parameters of quantitative prosody models

Oliver Jokisch, Hansjörg Mixdorff, Hans Kruschke, Ulrich Kordon

The article introduces a novel hybrid data driven and rule based approach for the prosody control in a TTS system, which combines the advantages of well-balanced, quantitative models with the flexible training of derived model parameters. Instancing the training of Fujisaki intonation parameters for German (MFGI) the article describes the hybrid data driven and rule based architecture HYDRA, the speech database, the extraction of the model parameters and the neural network (NN) training of these parameters. Preliminary results using the hybrid intonation model are presented. A hybrid neural network and rule based, quantitative model can be easily parameterized and adapted e.g. for multilingual applications, but has a higher complexity and requires the automatic extraction of the model parameters from a speech database.


Cite as: Jokisch, O., Mixdorff, H., Kruschke, H., Kordon, U. (2000) Learning the parameters of quantitative prosody models. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 1, 645-648

@inproceedings{jokisch00_icslp,
  author={Oliver Jokisch and Hansjörg Mixdorff and Hans Kruschke and Ulrich Kordon},
  title={{Learning the parameters of quantitative prosody models}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 1, 645-648}
}