ISCA Archive SpeechProsody 2008
ISCA Archive SpeechProsody 2008

Detecting prominence in conversational speech: pitch accent, givenness and focus

Vivek Kumar Rangarajan Sridhar, Ani Nenkova, Shrikanth Narayanan, Dan Jurafsky

The variability and reduction that are characteristic of talking in natural interaction make it very difficult to detect prominence in conversational speech. In this paper, we present analytic studies and automatic detection results for pitch accent, as well as on the realization of information structure phenomena like givenness and focus. For pitch accent, our conditional random field model combining acoustic and textual features has an accuracy of 78%, substantially better than chance performance of 58%. For givenness and focus, our analysis demonstrates that even in conversational speech there are measurable differences in acoustic properties and that an automatic detector for these categories can perform significantly above chance.


Cite as: Sridhar, V.K.R., Nenkova, A., Narayanan, S., Jurafsky, D. (2008) Detecting prominence in conversational speech: pitch accent, givenness and focus. Proc. Speech Prosody 2008, 453-456

@inproceedings{sridhar08_speechprosody,
  author={Vivek Kumar Rangarajan Sridhar and Ani Nenkova and Shrikanth Narayanan and Dan Jurafsky},
  title={{Detecting prominence in conversational speech: pitch accent, givenness and focus}},
  year=2008,
  booktitle={Proc. Speech Prosody 2008},
  pages={453--456}
}