Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

Advances in Lecture Recognition: The ISL RT-06S Evaluation System

Christian Fügen (1), Matthias Wölfel (1), John W. McDonough (1), Shajith Ikbal (1), Florian Kraft (1), Kornel Laskowski (1), Mari Ostendorf (1,2), Sebastian Stüker (1), Kenichi Kumatani (1)

(1) Universität Karlsruhe, Germany; (2) University of Washington, USA

This paper describes the 2006 lecture recognition system developed at the Interactive Systems Laboratories (ISL), for individual headmicrophone (IHM), single distant microphone (SDM), and multiple distant microphones (MDM) conditions. It was evaluated in RT-06S rich transcription meeting evaluation sponsored by the US National Institute of Standards and Technologies (NIST). We describe the principal differences between our current system and those submitted in previous years, namely, improved acoustic and language models, cross adaptation between systems with different front-ends and phoneme sets, and the use of various automatic speech segmentation algorithms. Our system achieved word error rates of 38.5% (53.4%) and 22.9% (32.2%), respectively, on the MDM and IHM conditions of the RT-05S (RT-06S) lecture evaluation set.

Full Paper

Bibliographic reference.  Fügen, Christian / Wölfel, Matthias / McDonough, John W. / Ikbal, Shajith / Kraft, Florian / Laskowski, Kornel / Ostendorf, Mari / Stüker, Sebastian / Kumatani, Kenichi (2006): "Advances in lecture recognition: the ISL RT-06s evaluation system", In INTERSPEECH-2006, paper 1415-Tue3A2O.2.