8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Exploiting Information Extraction Annotations for Document Retrieval in Distillation Tasks

Dilek Hakkani-Tür (1), Gokhan Tur (2), Michael Levit (1)

(2) SRI International, USA

Information distillation aims to extract relevant pieces of information related to a given query from massive, possibly multilingual, audio and textual document sources. In this paper, we present our approach for using information extraction annotations to augment document retrieval for distillation. We take advantage of the fact that some of the distillation queries can be associated with annotation elements introduced for the NIST Automatic Content Extraction (ACE) task. We experimentally show that using the ACE events to constrain the document set returned by an information retrieval engine significantly improves the precision at various recall rates for two different query templates.

Full Paper

Bibliographic reference.  Hakkani-Tür, Dilek / Tur, Gokhan / Levit, Michael (2007): "Exploiting information extraction annotations for document retrieval in distillation tasks", In INTERSPEECH-2007, 330-333.