ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Data driven intonation modelling of 6 languages

Jeska Buhmann, Halewijn Vereecken, Justin Fackrell, Jean-Pierre Martens, Bert van Coile

A method for creating multi-lingual intonation models is described. The method adheres closely to the pioneering work of Traber, in that a recurrent neural network (RNN) predicts a number of F0 values per syllable. An important aspect of the work presented here is the selection of linguistic and prosodic features that are suitable for predicting the observed intonation phenomena in different languages. Another aspect is the use of automatic labelling techniques for the preparation of the training data. Experiments on six languages demonstrate that even though there are differences in performance across languages, it is possible to obtain good results for all six languages. More importantly, making use of automatic labelling techniques for the construction of the training corpora, tends to give better results than making use of manual labelling techniques.


Cite as: Buhmann, J., Vereecken, H., Fackrell, J., Martens, J.-P., Coile, B.v. (2000) Data driven intonation modelling of 6 languages. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 179-182

@inproceedings{buhmann00_icslp,
  author={Jeska Buhmann and Halewijn Vereecken and Justin Fackrell and Jean-Pierre Martens and Bert van Coile},
  title={{Data driven intonation modelling of 6 languages}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 3, 179-182}
}