Automatic Analysis of Phonetic Speech Style Dimensions

Neville Ryant, Mark Liberman


We apply automated analysis methods to create a multidimensional characterization of the prosodic characteristics of a large variety of speech datasets, with the goal of developing a general framework for comparing prosodic styles. Our datasets span styles including conversation, fluent reading, extemporized narratives, political speech, and advertisements; we compare several different languages including English, Spanish, and Chinese; and the features we extract are based on the joint distributions of F0 and amplitude values and sequences, speech and silence segment durations, syllable durations, and modulation spectra. Rather than focus on the acoustic correlates of a small number of discrete and mutually exclusive categories, we aim to characterize the space in which diverse speech styles live.


DOI: 10.21437/Interspeech.2016-1355

Cite as

Ryant, N., Liberman, M. (2016) Automatic Analysis of Phonetic Speech Style Dimensions. Proc. Interspeech 2016, 77-81.

Bibtex
@inproceedings{Ryant+2016,
author={Neville Ryant and Mark Liberman},
title={Automatic Analysis of Phonetic Speech Style Dimensions},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-1355},
url={http://dx.doi.org/10.21437/Interspeech.2016-1355},
pages={77--81}
}