ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Dysarthric speech recognition using dysarthria-severity-dependent and speaker-adaptive models

Myung Jong Kim, Joohong Yoo, Hoirin Kim

Dysarthria is a motor speech disorder that impairs the physical production of speech. Modern automatic speech recognition for normal speech is ineffective for dysarthric speech due to the large mismatch of acoustic characteristics. In this paper, a new speaker adap- tation scheme is proposed to reduce the mismatch. First, a speaker with dysarthria is classified into one of the pre-defined severitylevels, and then an initial model to be adapted is selected depending on their severity-level. The candidates of an initial model are generated using dysarthric speech associated with their labeled severitylevel in the training phase. Finally, speaker adaptation is applied to the selected initial model. Evaluation of the proposed method on a database of several hundred words for 31 speakers with moderate to mild dysarthria showed that the proposed approach provides substantial improvement over the conventional speaker-adaptive system when a small amount of adaptation data is available.


doi: 10.21437/Interspeech.2013-320

Cite as: Kim, M.J., Yoo, J., Kim, H. (2013) Dysarthric speech recognition using dysarthria-severity-dependent and speaker-adaptive models. Proc. Interspeech 2013, 3622-3626, doi: 10.21437/Interspeech.2013-320

@inproceedings{kim13d_interspeech,
  author={Myung Jong Kim and Joohong Yoo and Hoirin Kim},
  title={{Dysarthric speech recognition using dysarthria-severity-dependent and speaker-adaptive models}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={3622--3626},
  doi={10.21437/Interspeech.2013-320}
}