The CUHK Dysarthric Speech Recognition Systems for English and Cantonese

Shoukang Hu, Shansong Liu, Heng Fai Chang, Mengzhe Geng, Jiani Chen, Lau Wing Chung, To Ka Hei, Jianwei Yu, Ka Ho Wong, Xunying Liu, Helen Meng


Speech disorders affect many people around the world and introduce a negative impact on their quality of life. Dysarthria is a neural-motor speech disorder that obstructs the normal production of speech. Current automatic speech recognition (ASR) systems are developed for normal speech. They are not suitable for accurate recognition of disordered speech. To the best of our knowledge, the majority of disordered speech recognition systems developed to date are for English. In this paper, we present two disordered speech recognition systems for both English and Cantonese. Both systems demonstrate competitive performance when compared with the Google speech recognition API and human recognition results.


Cite as: Hu, S., Liu, S., Chang, H.F., Geng, M., Chen, J., Chung, L.W., Hei, T.K., Yu, J., Wong, K.H., Liu, X., Meng, H. (2019) The CUHK Dysarthric Speech Recognition Systems for English and Cantonese. Proc. Interspeech 2019, 3669-3670.


@inproceedings{Hu2019,
  author={Shoukang Hu and Shansong Liu and Heng Fai Chang and Mengzhe Geng and Jiani Chen and Lau Wing Chung and To Ka Hei and Jianwei Yu and Ka Ho Wong and Xunying Liu and Helen Meng},
  title={{The CUHK Dysarthric Speech Recognition Systems for English and Cantonese}},
  year=2019,
  booktitle={Proc. Interspeech 2019},
  pages={3669--3670}
}