8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Wavelet-based Front-End for Electromyographic Speech Recognition

Michael Wand, Szu-Chen Stan Jou, Tanja Schultz

Carnegie Mellon University, USA

In this paper we present our investigations on the potential of wavelet-based preprocessing for surface electromyographic speech recognition. We implemented several variants of the Discrete Wavelet Transform and applied them to electromyographical data. First we examined different transforms with various filters and decomposition levels and found that the Redundant Discrete Wavelet Transform performs the best among all tested wavelet transforms. Furthermore, we compared the best wavelet transform to our EMG optimized spectral- and time-domain features. The results showed that the best wavelet transform slightly outperforms the optimized features with 30.9% word error rate compared to 32% for the optimized EMG spectral and time-domain features. Both numbers were achieved on a 108 word vocabulary test set using phone based acoustic models trained on continuously spoken speech captured by EMG.

Full Paper

Bibliographic reference.  Wand, Michael / Jou, Szu-Chen Stan / Schultz, Tanja (2007): "Wavelet-based front-end for electromyographic speech recognition", In INTERSPEECH-2007, 686-689.