INTERSPEECH 2012
13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

A Continuous Prominence Score based on Acoustic Features

Jean-Philippe Goldman (1), Mathieu Avanzi (2), Antoine Auchlin (1), Anne Catherine Simon (3)

(1) Department of Linguistics, University of Geneva, Switzerland
(2) Chaire de linguistique française, Université de Neuchâtel, Switzerland
(3) Institut Langage & Communication, Centre VALIBEL, UCLouvain, Belgium

Up to now, prominence detection has mainly been considered a binary matter, a syllable being considered as prominent or not. This contribution aims at developing an automatic detection procedure of gradual prominence. Based on 4 prosodic parameters (relative duration, relative F0, F0 movement and pause duration), the system provides each syllable with a gradual score of prominence ranging from 0 (non-prominent syllable) to 4 (extra prominent syllable). The automatic detection (ProsoProm) relies on a manually annotated corpus (18 minutes, or 3669 syllables, of speech annotated by three experts) and is cumulative (the relative weight of each parameter is taken into account in order to compute a global score for each syllable). Discussion of the results includes a comparison to a similar software (Analor) and a qualitative analysis of misses and false detections. The agreement between automatic and (median) human annotation reaches a Kappa score of 0.8.

Index Terms: prosody, speech, prominence detection, cumulative prominence, automatic prosodic analysis, expert vs automatic prosodic annotation

Full Paper

Bibliographic reference.  Goldman, Jean-Philippe / Avanzi, Mathieu / Auchlin, Antoine / Simon, Anne Catherine (2012): "A continuous prominence score based on acoustic features", In INTERSPEECH-2012, 2414-2417.