Third International Conference on Spoken Language Processing (ICSLP 94)
This paper evaluates the performance of two automatic labelling systems for intonation by using their output to predict the position and type of prominences and boundaries labelled in a ToBI transcription of twelve dialogues. It shows that they both model the prosodic information well, and that improvements in modelling are gained when including segmental information about the duration and energy profiles of the utterance. However, after parameter reduction, the features that survive are segmental rather than F0-related.
Bibliographic reference. Campbell, Nick (1994): "Combining the use of duration and F0 in an automatic analysis of dialogue prosody", In ICSLP-1994, 1111-1114.