ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

A multi-level context-dependent prosodic model applied to durational modeling

Nicolas Obin, Xavier Rodet, Anne Lacheret-Dujour

We present in this article a multi-level prosodic model based on the estimation of prosodic parameters on a set of well defined linguistic units. Different linguistic units are used to represent different scales of prosodic variations (local and global forms) and thus to estimate the linguistic factors that can explain the variations of prosodic parameters independently on each level. This model is applied to the modeling of syllable-based durational parameters on two read speech corpora — laboratory and acted speech. Compared to a syllable-based baseline model, the proposed approach improves performance in terms of the temporal organization of the predicted durations (correlation score) and reduces model’s complexity, when showing comparable performance in terms of relative prediction error.


doi: 10.21437/Interspeech.2009-188

Cite as: Obin, N., Rodet, X., Lacheret-Dujour, A. (2009) A multi-level context-dependent prosodic model applied to durational modeling. Proc. Interspeech 2009, 512-515, doi: 10.21437/Interspeech.2009-188

@inproceedings{obin09_interspeech,
  author={Nicolas Obin and Xavier Rodet and Anne Lacheret-Dujour},
  title={{A multi-level context-dependent prosodic model applied to durational modeling}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={512--515},
  doi={10.21437/Interspeech.2009-188}
}