Prominence-based Evaluation of L2 Prosody

Heini Kallio, Antti Suni, Päivi Virkkunen, Juraj Šimko


Prosody in terms of word and sentence stress is one of the most difficult features for many second language (L2) speakers to learn and it can be hypothesized that assessing the learner's prosodic abilities could provide a good measure for assessing the learners' spoken language skills in general. Automatic assessment is, however, dependent on reliable automatic analyses of prosodic features for comparing the productions between native (L1) and L2 speech. Here we investigate, whether estimated prosodic prominence levels of syllables can be used to predict the prosodic competence of Finnish learners of Swedish. Syllable level prominence was estimated for 99 L2 and 25 native Swedish utterances using continuous wavelet transform analysis with combinations of F0, energy and duration features. The L2 utterances were assessed by four expert raters using the revised CEFR scale for prosodic features. Correlations of prominence estimates for L2 utterances with estimates for L1 utterances and linguistic stress patterns were used as a measure of prosodic proficiency of the L2 speakers. The results show that these estimates correlate significantly with the assessments of expert raters. Overall, the results provide strong support for the use of the wavelet-based prominence estimation techniques in automatic assessment of L2 proficiency.


 DOI: 10.21437/Interspeech.2018-1873

Cite as: Kallio, H., Suni, A., Virkkunen, P., Šimko, J. (2018) Prominence-based Evaluation of L2 Prosody. Proc. Interspeech 2018, 1838-1842, DOI: 10.21437/Interspeech.2018-1873.


@inproceedings{Kallio2018,
  author={Heini Kallio and Antti Suni and Päivi Virkkunen and Juraj Šimko},
  title={Prominence-based Evaluation of L2 Prosody},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={1838--1842},
  doi={10.21437/Interspeech.2018-1873},
  url={http://dx.doi.org/10.21437/Interspeech.2018-1873}
}