ISCA Archive SSW 2013
ISCA Archive SSW 2013

Prosodically modifying speech for unit selection speech synthesis databases

Ladan Golipour, Alistair Conkie, Ann Syrdal

This paper investigates the practical limits of artificially increasing the prosodic richness of a unit selection database by transforming the prosodic realization of constituent sentences. The resulting high-quality transformed sentences are added to the database as new material. We examine in detail one of the most challenging prosodic transformations, namely converting statements into yes/no questions. Such transformations can require very large prosodic modifications while at the same time there is a need to retain as much naturalness of the signal as possible. Our data-driven approach relies on learning templates of pitch contours for different stress patterns of interrogative sentences from training data and later on applying these template pitch contours on unseen statements to generate the corresponding questions. We examine experimentally how the modified signals contribute to the perceived synthesis quality of the resulting database when compared with baseline unmodified databases.

Index Terms: speech synthesis, RELP, prosody


Cite as: Golipour, L., Conkie, A., Syrdal, A. (2013) Prosodically modifying speech for unit selection speech synthesis databases. Proc. 8th ISCA Workshop on Speech Synthesis (SSW 8), 255-259

@inproceedings{golipour13_ssw,
  author={Ladan Golipour and Alistair Conkie and Ann Syrdal},
  title={{Prosodically modifying speech for unit selection speech synthesis databases}},
  year=2013,
  booktitle={Proc. 8th ISCA Workshop on Speech Synthesis (SSW 8)},
  pages={255--259}
}