ISCA Archive WOCCI 2017
ISCA Archive WOCCI 2017

Automatic recognition of children's read speech for stuttering application

Sadeen Alharbi, Anthony J H Simons, Shelagh Brumfitt, Phil Green

Stuttering is a common speech disfluency that may persist into adulthood if not treated in its early stages. Techniques from spoken language understanding may be applied to provide automated diagnoses of stuttering from voice recordings; however, there are several difficulties, including the lack of training data involving young children and the high dimensionality of these data. This study investigates how automatic speech recognition (ASR) could help clinicians by providing a tool that automatically recognises stuttering events and provides a useful written transcription of what was said. In addition, to enhance the performance of ASR and to alleviate the lack of stuttering data, this study examines the effect of augmenting the language model with artificially generated data. The performance of the ASR tool with and without language model augmentation is compared. Following language model augmentation, the ASR tool's performance improved recall from 38% to 62.2% and precision from 56.58% to 71%. When mis-recognised events are more coarsely classified as stuttering/non-stuttering events, the performance improves up to 73% in recall and 84% in precision. Although the obtained results are not perfect, they map to fairly robust stutter/non-stutter decision boundaries.

doi: 10.21437/WOCCI.2017-1

Cite as: Alharbi, S., Simons, A.J.H., Brumfitt, S., Green, P. (2017) Automatic recognition of children's read speech for stuttering application. Proc. 6th Workshop on Child Computer Interaction (WOCCI 2017), 1-6, doi: 10.21437/WOCCI.2017-1

  author={Sadeen Alharbi and Anthony J H Simons and Shelagh Brumfitt and Phil Green},
  title={{Automatic recognition of children's read speech for stuttering application}},
  booktitle={Proc. 6th Workshop on Child Computer Interaction (WOCCI 2017)},