International Workshop on Spoken Language Translation (IWSLT) 2012

Hong Kong
December 6-7, 2012

FBK’s Machine Translation Systems for IWSLT 2012’s TED Lectures

Nicholas Ruiz, Arianna Bisazza, Roldano Cattoni, Marcello Federico

Fondazione Bruno Kessler-IRST, Povo (TN), Italy

This paper reports on FBK’s Machine Translation (MT) submissions at the IWSLT 2012 Evaluation on the TED talk translation tasks. We participated in the English-French and the Arabic-, Dutch-, German-, and Turkish-English translation tasks. Several improvements are reported over our last year baselines. In addition to using fill-up combinations of phrase-tables for domain adaptation, we explore the use of corpora filtering based on cross-entropy to produce concise and accurate translation and language models. We describe challenges encountered in under-resourced languages (Turkish) and language-specific preprocessing needs.

Full Paper    Presentation

Bibliographic reference.  Ruiz, Nicholas / Bisazza, Arianna / Cattoni, Roldano / Federico, Marcello (2012): "FBK’s machine translation systems for IWSLT 2012’s TED lectures", In IWSLT-2012, 61-68.