ISCA Archive Interspeech 2021
ISCA Archive Interspeech 2021

A Comparative Study of Different EMG Features for Acoustics-to-EMG Mapping

Manthan Sharma, Navaneetha Gaddam, Tejas Umesh, Aditya Murthy, Prasanta Kumar Ghosh

Electromyography (EMG) signals have been extensively used to capture facial muscle movements while speaking since they are one of the most closely related bio-signals generated during speech production. In this work, we focus on speech acoustics to EMG prediction. We present a comparative study of ten different EMG signal-based features including Time Domain (TD) features existing in the literature to examine their effectiveness in speech acoustics to EMG inverse (AEI) mapping. We propose a novel feature based on the Hilbert envelope of the filtered EMG signal. The raw EMG signal is reconstructed from these features as well. For the AEI mapping, we use a bi-directional long short-term memory (BLSTM) network in a session-dependent manner. To estimate the raw EMG signal from the EMG features, we use a CNN-BLSTM model comprising of a convolution neural network (CNN) followed by BLSTM layers. AEI mapping performance using the BLSTM network reveals that the Hilbert envelope based feature is predicted from speech with the highest accuracy, among all the features. Therefore, it could be the most representative feature of the underlying muscle activation during speech production. The proposed Hilbert envelope feature, when used together with the existing TD features, improves the raw EMG signal reconstruction performance compared to using the TD features alone.


doi: 10.21437/Interspeech.2021-2240

Cite as: Sharma, M., Gaddam, N., Umesh, T., Murthy, A., Ghosh, P.K. (2021) A Comparative Study of Different EMG Features for Acoustics-to-EMG Mapping. Proc. Interspeech 2021, 616-620, doi: 10.21437/Interspeech.2021-2240

@inproceedings{sharma21_interspeech,
  author={Manthan Sharma and Navaneetha Gaddam and Tejas Umesh and Aditya Murthy and Prasanta Kumar Ghosh},
  title={{A Comparative Study of Different EMG Features for Acoustics-to-EMG Mapping}},
  year=2021,
  booktitle={Proc. Interspeech 2021},
  pages={616--620},
  doi={10.21437/Interspeech.2021-2240}
}