Rhythmic patterns and literary genres in synthesized speech

Elisabeth Delais-Roussarie, Damien Lolive, Hiyon Yoo, David Guennec


In this paper, the rhythmic patterns observed in natural and synthesized speech are compared for three literary forms (rhymes, poems, and fairy tales). The aim of the comparison is to evaluate how rhythm could be improved in synthesized speech, which could allow adapting it to specific styles or genres. The study is based on the analysis of a corpus of six rhymes, four poems and two extracts from fairy tales. All texts were recorded by three speakers and were generated with two distinct synthesized voices. The comparison of the rhythmic patterns observed is done by analyzing duration in relation to prosodic structure in the various data sets. This approach allows showing that rhythmic differences between synthesized and natural speech are mostly due to the marking of prosodic structure.


DOI: 10.21437/SpeechProsody.2016-12

Cite as

Delais-Roussarie, E., Lolive, D., Yoo, H., Guennec, D. (2016) Rhythmic patterns and literary genres in synthesized speech. Proc. Speech Prosody 2016, 54-58.

Bibtex
@inproceedings{Delais-Roussarie+2016,
author={Elisabeth Delais-Roussarie and Damien Lolive and Hiyon Yoo and David Guennec},
title={Rhythmic patterns and literary genres in synthesized speech},
year=2016,
booktitle={Speech Prosody 2016},
doi={10.21437/SpeechProsody.2016-12},
url={http://dx.doi.org/10.21437/SpeechProsody.2016-12},
pages={54--58}
}