ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Including pitch accent optionality in unit selection text-to-speech synthesis

Leonardo Badino, Robert A. J. Clark, Volker Strom

A significant variability in pitch accent placement is found when comparing the patterns of prosodic prominence realized by different English speakers reading the same sentences. In this paper we describe a simple approach to incorporate this variability to synthesize prosodic prominence in unit selection text-to-speech synthesis.

The main motivation of our approach is that by taking into account the variability of accent placements we enlarge the set of prosodically acceptable speech units, thus increasing the chances of selecting a good quality sequence of units, both in prosodic and segmental terms.

Results on a large scale perceptual test show the benefits of our approach and indicate directions for further improvements.


doi: 10.21437/Interspeech.2008-549

Cite as: Badino, L., Clark, R.A.J., Strom, V. (2008) Including pitch accent optionality in unit selection text-to-speech synthesis. Proc. Interspeech 2008, 2118-2121, doi: 10.21437/Interspeech.2008-549

@inproceedings{badino08_interspeech,
  author={Leonardo Badino and Robert A. J. Clark and Volker Strom},
  title={{Including pitch accent optionality in unit selection text-to-speech synthesis}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={2118--2121},
  doi={10.21437/Interspeech.2008-549}
}