Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

Recognition of Classroom Lectures in European Portuguese

Isabel Trancoso (1), Ricardo Nunes (2), Lus Neves (2), Cu Viana (3), Helena Moniz (3), Diamantino Caseiro (1), Ana Isabel Mata (3)

(1) L2F INESC-ID/IST, Portugal; (2) L2F INESC-ID, Portugal; (3) CLUL, Portugal

Classroom lectures may be very challenging for automatic speech recognizers, because the vocabulary may be very specific and the speaking style very spontaneous. Our first experiments using a recognizer trained for Broadcast News resulted in word error rates near 60%, clearly confirming the need for adaptation to the specific topic of the lectures, on one hand, and for better strategies for handling spontaneous speech. This paper describes our efforts in these two directions: the different domain adaptation steps that lowered the error rate to 45%, with very little transcribed adaptation material, and the exploratory study of spontaneous speech phenomena in European Portuguese, namely concerning filled pauses.

Full Paper

Bibliographic reference.  Trancoso, Isabel / Nunes, Ricardo / Neves, Lus / Viana, Cu / Moniz, Helena / Caseiro, Diamantino / Mata, Ana Isabel (2006): "Recognition of classroom lectures in european portuguese", In INTERSPEECH-2006, paper 1524-Mon2A2O.3.