15th Annual Conference of the International Speech Communication Association

September 14-18, 2014

The EMG-UKA Corpus for Electromyographic Speech Processing

Michael Wand, Matthias Janke, Tanja Schultz

KIT, Germany

This article gives an overview of the EMG-UKA corpus, a corpus of electromyographic ( EMG) recordings of articulatory activity enabling speech processing (in particular speech recognition and synthesis) based on EMG signals, with the purpose of building Silent Speech interfaces. Data is available in multiple speaking modes, namely audibly spoken, whispered, and silently articulated speech. Besides the EMG data, synchronous acoustic data was additionally recorded to serve as a reference. The corpus comprises 63 recorded sessions from 8 speakers, the total amount of data is 7:32 hours. A trial subset, consisting of 1:52 hours of data, is freely available for download.

Full Paper

Bibliographic reference.  Wand, Michael / Janke, Matthias / Schultz, Tanja (2014): "The EMG-UKA corpus for electromyographic speech processing", In INTERSPEECH-2014, 1593-1597.