11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Prominence Detection in Swedish Using Syllable Correlates

Samer Al Moubayed, Jonas Beskow

KTH, Sweden

This paper presents an approach to estimating word level prominence in Swedish using syllable level features. The paper discusses the mismatch problem of annotations between word level perceptual prominence and its acoustic correlates, context, and data scarcity. 200 sentences are annotated by 4 speech experts with prominence on 3 levels. A linear model for feature extraction is proposed on a syllable level features, and weights for these features are optimized to match word level annotations. We show that using syllable level features and estimating weights for the acoustic correlates to minimize the word level estimation error gives better detection accuracy compared to word level features, and that both features exceed the baseline accuracy.

Full Paper

Bibliographic reference.  Al Moubayed, Samer / Beskow, Jonas (2010): "Prominence detection in Swedish using syllable correlates", In INTERSPEECH-2010, 1784-1787.