ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Speech recognition for vocalized and subvocal modes of production using surface EMG signals from the neck and face

Geoffrey S. Meltzner, Jason Sroka, James T. Heaton, L. Donald Gilmore, Glen Colby, Serge Roy, Nancy Chen, Carlo J. De Luca

We report automatic speech recognition accuracy for individual words using eleven surface electromyographic (sEMG) recording locations on the face and neck during three speaking modes: vocalized, mouthed, and mentally rehearsed. An HMM based recognition system was trained and tested on a 65 word vocabulary produced by 9 American English speakers in all three speaking modes. Our results indicate high sEMG-based recognition accuracy for the vocalized and mouthed speaking modes (mean rates of 92.1% and 86.7% respectively), but an inability to conduct recognition on mentally rehearsed speech due to a lack of sufficient sEMG activity.


doi: 10.21437/Interspeech.2008-661

Cite as: Meltzner, G.S., Sroka, J., Heaton, J.T., Gilmore, L.D., Colby, G., Roy, S., Chen, N., Luca, C.J.D. (2008) Speech recognition for vocalized and subvocal modes of production using surface EMG signals from the neck and face. Proc. Interspeech 2008, 2667-2670, doi: 10.21437/Interspeech.2008-661

@inproceedings{meltzner08_interspeech,
  author={Geoffrey S. Meltzner and Jason Sroka and James T. Heaton and L. Donald Gilmore and Glen Colby and Serge Roy and Nancy Chen and Carlo J. De Luca},
  title={{Speech recognition for vocalized and subvocal modes of production using surface EMG signals from the neck and face}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={2667--2670},
  doi={10.21437/Interspeech.2008-661}
}