8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

From One Base Form to Multiple Output Styles - Predicting Stylistic Dynamics of Discourse Prosody

Chiu-yu Tseng, Zhao-yu Su

Academia Sinica, Taiwan

We hypothesize that various prosody output styles can be predicted and simulated from one default base form by accounting for contributions from higher level information to cross-phrase prosodic relationship. Speech materials of four prosody styles were selected: (1.) Han and Tang poetry, (2.) Tang Ballads and Song poetry, (3.) Qin, Tang and Song classic prose and (4.) contemporary TV weather forecast. F0 contours were analyzed using the Fujisaki model, while quantitative analyses of predictions from layered-and-cumulative contribution specified by the HPG (Hierarchical Prosodic phrase Grouping) framework [Tseng et al, 2004; 2005; 2006] were performed across styles and speakers. Results confirmed that higher level contribution is significant across style; contribution distribution patterns and style specific; more regular prosodic formats require more contribution from higher level; stylistic dynamics are predictable; and the HPG base form is indeed default.

Full Paper

Bibliographic reference.  Tseng, Chiu-yu / Su, Zhao-yu (2007): "From one base form to multiple output styles - predicting stylistic dynamics of discourse prosody", In INTERSPEECH-2007, 110-113.