12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Automatic Generation of Listening Comprehension Learning Material in European Portuguese

Thomas Pellegrini (1), Rui Correia (1), Isabel Trancoso (1), Jorge Baptista (2), Nuno Mamede (1)

(1) INESC-ID Lisboa, Portugal
(2) Universidade do Algarve, Portugal

The goal of this work is the automatic selection of materials for a listening comprehension game. We would like to select automatically transcribed sentences from recent broadcast news corpora, in order to gather material for the games with little human effort. The recognized words are used as the ground solution of the exercises, thus sentences with misrecognitions need to be filtered out. Our experiments confirmed the feasibility of the filter chain that automatically selects sentences, although harder confidence thresholds may be needed. Together with the correct words, wrong candidates, namely distractors, are also needed to build the exercises. Two techniques of distractor generation are presented, either based on the confusion networks produced by the recognizer, or on phonetic distances. The experiments confirmed the complementarity of both approaches.

Full Paper

Bibliographic reference.  Pellegrini, Thomas / Correia, Rui / Trancoso, Isabel / Baptista, Jorge / Mamede, Nuno (2011): "Automatic generation of listening comprehension learning material in european portuguese", In INTERSPEECH-2011, 1629-1632.