In this paper, we propose a method of formant prediction from pole and bandwidth data, and apply this method to automatically extract F1 and F2 values from a corpus of regional dialect variation in North America that contains 134,000 manual formant measurements. These predicted formants are shown to increase performance over the default formant values from a popular speech analysis package. Finally, we demonstrate that sociolinguistic analysis based on vowel formant data can be conducted reliably using the automatically predicted values, and we argue that sociolinguists should begin to use this methodology in order to be able to analyze larger amounts of data efficiently.
Cite as: Evanini, K., Isard, S., Liberman, M. (2009) Automatic formant extraction for sociolinguistic analysis of large corpora. Proc. Interspeech 2009, 1655-1658, doi: 10.21437/Interspeech.2009-502
@inproceedings{evanini09_interspeech, author={Keelan Evanini and Stephen Isard and Mark Liberman}, title={{Automatic formant extraction for sociolinguistic analysis of large corpora}}, year=2009, booktitle={Proc. Interspeech 2009}, pages={1655--1658}, doi={10.21437/Interspeech.2009-502} }