International Workshop on Spoken Language Translation (IWSLT) 2008
Honolulu, Hawaii, USA
This paper describes the National Institute of Information and Communications Technology/Advanced Telecommunications Research Institute International (NICT/ATR) statistical machine translation (SMT) system used for the IWSLT 2008 evaluation campaign. We participated in the Chinese English (Challenge Task), EnglishChinese (Challenge Task), ChineseEnglish (BTEC Task), ChineseSpanish (BTEC Task), and ChineseEnglishSpanish (PIVOT Task) translation tasks. In the EnglishChinese translation Challenge Task, we focused on exploring various factors for the EnglishChinese translation because the research on the translation of EnglishChinese is scarce compared to the opposite direction. In the ChineseEnglish translation Challenge Task, we employed a novel clustering method, where training sentences similar to the development data in terms of the word error rate formed a cluster. In the pivot translation task, we integrated two strategies for pivot translation by linear interpolation.
Full Paper Presentation (pdf)
Bibliographic reference. Utiyama, Masao / Finch, Andrew / Okuma, Hideo / Paul, Michael / Cao, Hailong / Yamamoto, Hirofumi / Yasuda, Keiji / Sumita, Eiichiro (2008): "The NICT/ATR speech translation system for IWSLT 2008", In IWSLT-2008, 77-84.