ISCA Archive Interspeech 2007
ISCA Archive Interspeech 2007

From one base form to multiple output styles - predicting stylistic dynamics of discourse prosody

Chiu-yu Tseng, Zhao-yu Su

We hypothesize that various prosody output styles can be predicted and simulated from one default base form by accounting for contributions from higher level information to cross-phrase prosodic relationship. Speech materials of four prosody styles were selected: (1.) Han and Tang poetry, (2.) Tang Ballads and Song poetry, (3.) Qin, Tang and Song classic prose and (4.) contemporary TV weather forecast. F0 contours were analyzed using the Fujisaki model, while quantitative analyses of predictions from layered-and-cumulative contribution specified by the HPG (Hierarchical Prosodic phrase Grouping) framework [Tseng et al, 2004; 2005; 2006] were performed across styles and speakers. Results confirmed that higher level contribution is significant across style; contribution distribution patterns and style specific; more regular prosodic formats require more contribution from higher level; stylistic dynamics are predictable; and the HPG base form is indeed default.


doi: 10.21437/Interspeech.2007-51

Cite as: Tseng, C.-y., Su, Z.-y. (2007) From one base form to multiple output styles - predicting stylistic dynamics of discourse prosody. Proc. Interspeech 2007, 110-113, doi: 10.21437/Interspeech.2007-51

@inproceedings{tseng07_interspeech,
  author={Chiu-yu Tseng and Zhao-yu Su},
  title={{From one base form to multiple output styles - predicting stylistic dynamics of discourse prosody}},
  year=2007,
  booktitle={Proc. Interspeech 2007},
  pages={110--113},
  doi={10.21437/Interspeech.2007-51}
}