ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Connecting rhythm and prominence in automatic ESL pronunciation scoring

Emily Nava, Joseph Tepperman, Louis Goldstein, Maria Luisa Zubizarreta, Shrikanth S. Narayanan

Past studies have shown that a native Spanish speaker’s use of phrasal prominence is a good indicator of her level of English prosody acquisition. Because of the cross-linguistic differences in the organization of phrasal prominence and durational contrasts, we hypothesize that those speakers with English-like prominence in their L2 speech are also expected to have acquired English-like rhythm. Statistics from a corpus of native and nonnative English confirm that speakers with an English-like phrasal prominence are also the ones who use English-like rhythm. Additionally, two methods of automatic score generation based on vowel duration times demonstrate a correlation of at least 0.6 between these automatic scores and subjective scores for phrasal prominence. These findings suggest that simple vowel duration measures obtained from standard automatic speech recognition methods can be salient cues for estimating subjective scores of prosodic acquisition, and of pronunciation in general.

doi: 10.21437/Interspeech.2009-237

Cite as: Nava, E., Tepperman, J., Goldstein, L., Zubizarreta, M.L., Narayanan, S.S. (2009) Connecting rhythm and prominence in automatic ESL pronunciation scoring. Proc. Interspeech 2009, 684-687, doi: 10.21437/Interspeech.2009-237

  author={Emily Nava and Joseph Tepperman and Louis Goldstein and Maria Luisa Zubizarreta and Shrikanth S. Narayanan},
  title={{Connecting rhythm and prominence in automatic ESL pronunciation scoring}},
  booktitle={Proc. Interspeech 2009},