ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Universal access: speech recognition for talkers with spastic dysarthria

Harsh Vardhan Sharma, Mark Hasegawa-Johnson

This paper describes the results of our experiments in small and medium vocabulary dysarthric speech recognition, using the database being recorded by our group under the Universal Access initiative. We develop and test speaker-dependent, word- and phone-level speech recognizers utilizing the hidden Markov Model architecture; the models are trained exclusively on dysarthric speech produced by individuals diagnosed with cerebral palsy. The experiments indicate that (a) different system configurations (being word vs. phone based, number of states per HMM, number of Gaussian components per state specific observation probability density etc.) give useful performance (in terms of recognition accuracy) for different speakers and different task-vocabularies, and (b) for very low intelligibility subjects, speech recognition outperforms human listeners on recognizing dysarthric speech.


doi: 10.21437/Interspeech.2009-444

Cite as: Sharma, H.V., Hasegawa-Johnson, M. (2009) Universal access: speech recognition for talkers with spastic dysarthria. Proc. Interspeech 2009, 1451-1454, doi: 10.21437/Interspeech.2009-444

@inproceedings{sharma09_interspeech,
  author={Harsh Vardhan Sharma and Mark Hasegawa-Johnson},
  title={{Universal access: speech recognition for talkers with spastic dysarthria}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={1451--1454},
  doi={10.21437/Interspeech.2009-444}
}