ISCA Archive SpeechProsody 2010
ISCA Archive SpeechProsody 2010

Cross-genre training for automatic prosody classification

Anna Margolis, Mari Ostendorf, Karen Livescu

We consider methods for training a prosodic classifier using labeled training data from a different genre than the one on which the system will be deployed. Two binary tasks are considered: word-level pitch accent and phrase boundary detection. Using radio news and conversational telephone speech, we consider cross-genre training using acoustic and textual features, and find that acoustic features transfer better than text features in most cases. We also find that a single classifier trained from both genres nearly matches genre-dependent performance. We then consider some simple unsupervised domain adaptation approaches, including class proportion adjustment, sample selection bias correction, and feature normalization. With the exception of class proportion adjustment, which is slightly helpful in one case but proves unstable, none of the approaches improve cross-genre performance over the baseline.

Index Terms: prosody recognition, domain adaptation

Cite as: Margolis, A., Ostendorf, M., Livescu, K. (2010) Cross-genre training for automatic prosody classification. Proc. Speech Prosody 2010, paper 113

  author={Anna Margolis and Mari Ostendorf and Karen Livescu},
  title={{Cross-genre training for automatic prosody classification}},
  booktitle={Proc. Speech Prosody 2010},
  pages={paper 113}