ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Generating fundamental frequency contours for speech synthesis in yorùbá

Daniel R. van Niekerk, Etienne Barnard

We present methods for modelling and synthesising fundamental frequency (F0) contours suitable for application in text-to-speech (TTS) synthesis of Yoruba (an African tone language). These methods are discussed and compared with a baseline approach using the HMM-based speech synthesis system HTS. Evaluation is done by comparing ten-fold cross validation squared errors on a small corpus of four speakers. We show that the proposed methods are relatively effective at modelling and generating F0 contours in this context, achieving lower error rates than the baseline. These results suggest that our methods will be useful for the generation of improved synthesis of tone in African languages, which has been a challenge to date.


doi: 10.21437/Interspeech.2013-112

Cite as: Niekerk, D.R.v., Barnard, E. (2013) Generating fundamental frequency contours for speech synthesis in yorùbá. Proc. Interspeech 2013, 1027-1031, doi: 10.21437/Interspeech.2013-112

@inproceedings{niekerk13_interspeech,
  author={Daniel R. van Niekerk and Etienne Barnard},
  title={{Generating fundamental frequency contours for speech synthesis in yorùbá}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={1027--1031},
  doi={10.21437/Interspeech.2013-112}
}