International Workshop on Spoken Language Translation (IWSLT) 2012
This paper reports on FBKs Machine Translation (MT) submissions at the IWSLT 2012 Evaluation on the TED talk translation tasks. We participated in the English-French and the Arabic-, Dutch-, German-, and Turkish-English translation tasks. Several improvements are reported over our last year baselines. In addition to using fill-up combinations of phrase-tables for domain adaptation, we explore the use of corpora filtering based on cross-entropy to produce concise and accurate translation and language models. We describe challenges encountered in under-resourced languages (Turkish) and language-specific preprocessing needs.
Full Paper Presentation
Bibliographic reference. Ruiz, Nicholas / Bisazza, Arianna / Cattoni, Roldano / Federico, Marcello (2012): "FBKs machine translation systems for IWSLT 2012s TED lectures", In IWSLT-2012, 61-68.