9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Methods to Optimize Transcription of On-Line Media

Sarah Conrod (1), Sara Basson (2), Dimitri Kanevsky (2)

(1) Cape Breton University, Canada; (2) IBM T.J. Watson Research Center, USA

This paper outlines the growing need to provide fast and low cost methods for providing transcripts of audio and video media to people who are deaf and hard of hearing. Outlined are three different methods for creating such transcripts including traditional manual transcription and two automatic speech recognition (ASR) methods: a semi-automatic process called shadowing and a web-based automatic transcription tool created by IBM. A pilot examining the three different methods was conducted and the results of these tests are provided and discussed, as well as potential future studies regarding the efficacy and usability of the outputs from the various methods.

Full Paper

Bibliographic reference.  Conrod, Sarah / Basson, Sara / Kanevsky, Dimitri (2008): "Methods to optimize transcription of on-line media", In INTERSPEECH-2008, 203-206.