In this paper we propose that prosodic structures in spontancous discourse exhibit both linear and superpositional characterictics, and that these reflect the different scopes of multi-tiered emotional and cognitive processes. The multi-tiered structure encompasses the syllable and word level, interphrase movement, and extended pitch level baseline rise and fall. Analysis of the data suggests that integration of the 3 different prosodic levels within an overall prosodic model provides a critical link for the generation of natural-sounding interactive speech systems.
Cite as: Esposito, R., Yang, L.-c. (1999) Levels of prosodic representation in spoken discourse: an empirical approach. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 1631-1634, doi: 10.21437/Eurospeech.1999-370
@inproceedings{esposito99_eurospeech, author={Richard Esposito and Li-chiung Yang}, title={{Levels of prosodic representation in spoken discourse: an empirical approach}}, year=1999, booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)}, pages={1631--1634}, doi={10.21437/Eurospeech.1999-370} }