ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Integration of metamodel and acoustic model for speech recognition

Hironori Matsumasa, Tetsuya Takiguchi, Yasuo Ariki, Ichao Li, Toshitaka Nakabayashi

We investigated the speech recognition of a person with articulation disorders resulting from athetoid cerebral palsy. The articulation of the first speech tends to become unstable due to strain on speech-related muscles, and that causes degradation of speech recognition. Therefore, we proposed a robust feature extraction method based on PCA (Principal Component Analysis) instead of MFCC [1]. In this paper, we discuss our effort to integrate a Metamodel [2] and Acoustic model approach. Meta-model has a technique for incorporating a model of a speaker's confusion matrix into the ASR process in such a way as to increase recognition accuracy. Its effectiveness has been confirmed by word recognition experiments.

s H. Matsumasa and T. Takiguchi and Y. Ariki and I. LI and T. Nakabayashi, "PCA-Based Feature Extraction for Fluctuation in Speaking Style of Articulation Disorders," INTERSPEECH-2007, pp. 1150-1153, 2007 (ISCA Archive,

O. C. Morales and S. Cox.."Modelling Confusion Matrices to Improve Speech Recognition Accuracy, with an Application to Dysarthric Speech," INTERSPEECH-2007, pp. 1565-1568, 2007. (ISCA Archive,

doi: 10.21437/Interspeech.2008-583

Cite as: Matsumasa, H., Takiguchi, T., Ariki, Y., Li, I., Nakabayashi, T. (2008) Integration of metamodel and acoustic model for speech recognition. Proc. Interspeech 2008, 2234-2237, doi: 10.21437/Interspeech.2008-583

  author={Hironori Matsumasa and Tetsuya Takiguchi and Yasuo Ariki and Ichao Li and Toshitaka Nakabayashi},
  title={{Integration of metamodel and acoustic model for speech recognition}},
  booktitle={Proc. Interspeech 2008},