International Workshop on Spoken Language Translation (IWSLT) 2009

Tokyo, Japan
December 1-2, 2009

The ICT Statistical Machine Translation Systems for the IWSLT 2009

Haitao Mi, Yang Liu, Tian Xia, Xinyan Xiao, Yang Feng, Jun Xie, Hao Xiong, Zhaopeng Tu, Daqi Zheng, Yajuan Lu, Qun Liu

Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China

This paper describes the ICT Statistical Machine Translation systems that used in the evaluation campaign of the International Workshop on Spoken Language Translation (IWSLT) 2009. For this year's evaluation, we participated in the Challenge Task (Chinese-English and English-Chinese) and BTEC Task (Chinese-English). And we mainly focus on one new method to improve single system's translation quality. Specifically, we developed a sentence-similarity based development set selection technique. For each task, we finally submitted the single system who got the maximum BLEU scores on the selected development set. The four single translation systems are based on different techniques: a linguistically syntax-based system, two formally syntax-based systems and a phrase-based system. Typically, we didn't use any rescoring or system combination techniques in this year's evaluation.

Full Paper     Presentation (pdf)

Bibliographic reference.  Mi, Haitao / Liu, Yang / Xia, Tian / Xiao, Xinyan / Feng, Yang / Xie, Jun / Xiong, Hao / Tu, Zhaopeng / Zheng, Daqi / Lu, Yajuan / Liu, Qun (2009): "The ICT statistical machine translation systems for the IWSLT 2009", In IWSLT-2009, 55-59.