14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Efficient Speech Transcription Through Respeaking

Matthias Sperber (1), Graham Neubig (2), Christian Fügen (3), Satoshi Nakamura (2), Alex Waibel (1)

(1) KIT, Germany
(2) NAIST, Japan
(3) Mobile Technologies GmbH, Germany

We propose a method for efficient off-line speech transcription through respeaking. Speech is segmented into smaller utterances using an initial automatic transcript. Respeaking is performed segment by segment, while confidence filtering helps save supervision effort. We conduct detailed experiments comparing speaking vs. typing, sequential vs. confidence-ordered supervision, and examine the effect of the respeaking word error rate on correction efficiency. Our results demonstrate that the proposed method can not only outperform typing in terms of correction efficiency, but is also much less demanding for the respeakers than traditional respeaking methods, consequently helping to keep costs down. Annotation and Classification of Political Advertisements

Full Paper

Bibliographic reference.  Sperber, Matthias / Neubig, Graham / Fügen, Christian / Nakamura, Satoshi / Waibel, Alex (2013): "Efficient speech transcription through respeaking", In INTERSPEECH-2013, 1087-1091.