ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Prosody recognition from speech utterances using acoustic and linguistic based models of prosodic events

Alistair Conkie, Giuseppe Riccardi, Richard C. Rose

A system for automatic recognition of prosodic events in speech utterances has been developed and applied to recognizing accent tones as defined by the tone and break index (ToBI) prosodic labeling standard. Both the acoustic and syntactic modeling portions of the system are described in the paper. The acoustic modeling portion of the system involves representation of ToBI labeled events using hidden Markov models (HMMs) that are defined over a set of prosodic features. The syntactic modeling component involves the prediction of prosodic events based on a stochastic finite state model defined over input labels obtained from a part-of-speech (POS) tagger. The system was evaluated in terms of its ability to recognize pitch accents in a single speaker read speech corpus when the orthographic transcription of the utterance was assumed to be known. It was shown to improve average labeling accuracy over a baseline text{only prosodic labeling system from 84.8% to 88.3%.


doi: 10.21437/Eurospeech.1999-135

Cite as: Conkie, A., Riccardi, G., Rose, R.C. (1999) Prosody recognition from speech utterances using acoustic and linguistic based models of prosodic events. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 523-526, doi: 10.21437/Eurospeech.1999-135

@inproceedings{conkie99_eurospeech,
  author={Alistair Conkie and Giuseppe Riccardi and Richard C. Rose},
  title={{Prosody recognition from speech utterances using acoustic and linguistic based models of prosodic events}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={523--526},
  doi={10.21437/Eurospeech.1999-135}
}