ISCA Archive Interspeech 2021
ISCA Archive Interspeech 2021

Optimizing an Automatic Creaky Voice Detection Method for Australian English Speaking Females

Hannah White, Joshua Penney, Andy Gibson, Anita Szakay, Felicity Cox

Creaky voice is a nonmodal phonation type that has various linguistic and sociolinguistic functions. Manually annotating creaky voice for phonetic analysis is time-consuming and labor-intensive. In recent years, automatic tools for detecting creaky voice have been proposed, which present the possibility for easier, faster and more consistent creak identification. One of these proposed tools is a Creak Detector algorithm that uses an automatic neural network taking its input from several acoustic cues to identify creaky voice. Previous work has suggested that the creak probability threshold at which this tool determines an instance to be creaky may vary depending on the speaker population. The present study investigates the optimal creak detection threshold for female Australian English speakers.

Results show further support for the practice of first finding the optimal threshold when using the Creak Detection algorithm on new data sets. Additionally, results show that accuracy of creaky voice detection using the Creak Detection algorithm can be significantly improved by excluding non-sonorant data.


doi: 10.21437/Interspeech.2021-711

Cite as: White, H., Penney, J., Gibson, A., Szakay, A., Cox, F. (2021) Optimizing an Automatic Creaky Voice Detection Method for Australian English Speaking Females. Proc. Interspeech 2021, 1384-1388, doi: 10.21437/Interspeech.2021-711

@inproceedings{white21_interspeech,
  author={Hannah White and Joshua Penney and Andy Gibson and Anita Szakay and Felicity Cox},
  title={{Optimizing an Automatic Creaky Voice Detection Method for Australian English Speaking Females}},
  year=2021,
  booktitle={Proc. Interspeech 2021},
  pages={1384--1388},
  doi={10.21437/Interspeech.2021-711}
}