We present the PERCEPT-R corpus, a labeled corpus of child speakers of American English with typical speech and residual speech sound disorders affecting rhotics. We demonstrate the utility of age-and-gender normalized formants extracted from PERCEPT-R in training support vector classifiers to predict ground-truth perceptual judgments of "rhotic” (i.e., dialect-typical) and "derhotic” phones for novel speakers (mean of participant-specific f-metrics = .83; SD = .18, N = 281).
Cite as: Benway, N., Preston, J.L., Hitchcock, E., Salekin, A., Sharma, H., McAllister, T. (2022) PERCEPT-R: An Open-Access American English Child/Clinical Speech Corpus Specialized for the Audio Classification of /ɹ/. Proc. Interspeech 2022, 3648-3652, doi: 10.21437/Interspeech.2022-10785
@inproceedings{benway22_interspeech, author={Nina Benway and Jonathan L. Preston and Elaine Hitchcock and Asif Salekin and Harshit Sharma and Tara McAllister}, title={{PERCEPT-R: An Open-Access American English Child/Clinical Speech Corpus Specialized for the Audio Classification of /ɹ/}}, year=2022, booktitle={Proc. Interspeech 2022}, pages={3648--3652}, doi={10.21437/Interspeech.2022-10785} }