International Workshop on Spoken Language Translation (IWSLT) 2007
This paper describes the CMU-UKA statistical machine translation systems submitted to the IWSLT 2007 evaluation campaign. Systems were submitted for three language-pairs: Japanese-to-English, Chinese-to-English and Arabic-to-English. All systems were based on a common phrase-based SMT (statistical machine translation) framework but for each language-pair a specific research problem was tackled. For Japanese-to-English we focused on two problems: first, punctuation recovery, and second, how to incorporate topic-knowledge into the translation framework. Our Chinese-to-English submission focused on syntaxaugmented SMT and for the Arabic-to-English task we focused on incorporating morphological-decomposition into the SMT framework. This research strategy enabled us to evaluate a wide variety of approaches which proved effective for the language pairs they were evaluated on.
Full Paper Presentation
Bibliographic reference. Lane, Ian / Zollmann, Andreas / Nguyen, Thuy Linh / Bach, Nguyen / Venugopal, Ashish / Vogel, Stephan / Rottmann, Kay / Zhang, Ying / Waibe, Alex (2007): "The CMU-UKA statistical machine translation systems for IWSLT 2007", In IWSLT-2007, 61-68.