ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Utterance-level normalization for relative articulation rate analysis

Tuomo Saarni, Jussi Hakokari, Jouni Isoaho, Tapio Salakoski

This study describes a computational method for studying variation in articulation rate in a qualitatively mixed speech corpus. The method works within the scope of individual utterances, replacing each single speech sound's time information with a coefficient based on its duration relative to its environment. It can be used to generalize and determine points of acceleration and deceleration in articulation at the phone level, even when the general speaking rate varies greatly due to speaker, style, and utterance length related effects. To demonstrate the usability of the proposed method, we track observed deceleration of articulation rate (a form of final lengthening) towards the ends of utterances in a linguistically uncontrolled Finnish-language speech corpus with several speakers and styles.


doi: 10.21437/Interspeech.2008-161

Cite as: Saarni, T., Hakokari, J., Isoaho, J., Salakoski, T. (2008) Utterance-level normalization for relative articulation rate analysis. Proc. Interspeech 2008, 538-541, doi: 10.21437/Interspeech.2008-161

@inproceedings{saarni08_interspeech,
  author={Tuomo Saarni and Jussi Hakokari and Jouni Isoaho and Tapio Salakoski},
  title={{Utterance-level normalization for relative articulation rate analysis}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={538--541},
  doi={10.21437/Interspeech.2008-161}
}