ISCA Archive TAL 2006
ISCA Archive TAL 2006

Prosodic fillers and discourse markers–discourse prosody and text prediction

Chiu-yu Tseng, Zhao-yu Su, Chun-Hsiang Chang, Chia-hung Tai

Mandarin Chinese fluent speech prosody is characterized by a hierarchical multiple-phrase structure that specifies how speech paragraphs are constituted via Prosodic Phrase Grouping. Hence we view spoken discourse prosody as yet another higher node treats PGs (Prosodic Phrase Groups) as sister constituents. The goals of present study are two fold: one is to study how speech paragraphs are connected in speech flow; another is to derive prosody prediction from text analysis. Investigating cross-phrase F0 range narrowing and F0 reset with boundary information, we further conducted corresponding text analysis for prosody prediction. Results revealed two types of PG connectors, one is redundant Prosodic Fillers (PF) that are mostly duration triggered and manifested through narrowed F0 ranges; another is obligatory Discourse Markers (DM) that are lexically and/or syntactically triggered and manifested through widened F0 ranges and resets. Both could be predicted from text analysis. We believe this is a significant step forward towards understanding the organization of discourse prosody. It could also be applied to speech synthesis and/or unlimited TTS for prosody enhancement.


Cite as: Tseng, C.-y., Su, Z.-y., Chang, C.-H., Tai, C.-h. (2006) Prosodic fillers and discourse markers–discourse prosody and text prediction. Proc. 2nd International Symposium on Tonal Aspects of Languages (TAL 2006), 108-113

@inproceedings{tseng06b_tal,
  author={Chiu-yu Tseng and Zhao-yu Su and Chun-Hsiang Chang and Chia-hung Tai},
  title={{Prosodic fillers and discourse markers–discourse prosody and text prediction}},
  year=2006,
  booktitle={Proc. 2nd International Symposium on Tonal Aspects of Languages (TAL 2006)},
  pages={108--113}
}