ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Automatic formant extraction for sociolinguistic analysis of large corpora

Keelan Evanini, Stephen Isard, Mark Liberman

In this paper, we propose a method of formant prediction from pole and bandwidth data, and apply this method to automatically extract F1 and F2 values from a corpus of regional dialect variation in North America that contains 134,000 manual formant measurements. These predicted formants are shown to increase performance over the default formant values from a popular speech analysis package. Finally, we demonstrate that sociolinguistic analysis based on vowel formant data can be conducted reliably using the automatically predicted values, and we argue that sociolinguists should begin to use this methodology in order to be able to analyze larger amounts of data efficiently.


doi: 10.21437/Interspeech.2009-502

Cite as: Evanini, K., Isard, S., Liberman, M. (2009) Automatic formant extraction for sociolinguistic analysis of large corpora. Proc. Interspeech 2009, 1655-1658, doi: 10.21437/Interspeech.2009-502

@inproceedings{evanini09_interspeech,
  author={Keelan Evanini and Stephen Isard and Mark Liberman},
  title={{Automatic formant extraction for sociolinguistic analysis of large corpora}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={1655--1658},
  doi={10.21437/Interspeech.2009-502}
}