International Workshop on Spoken Language Translation (IWSLT) 2010

Paris, France
December 2-3, 2010

If I Only Had a Parser: Poor Man's Syntax for Hierarchical Machine Translation

David Vilar, Daniel Stein, Stephan Peitz, Hermann Ney

Lehrstuhl für Informatik 6, RWTH Aachen University, Aachen, Germany

In the last few years, several enhancements for the hierarchical phrase-based translation model have been proposed. They aim to include additional syntactic information in the translation process in order to achieve better fluency in the generated output.
   In this work we review and compare three such methods: parsematch, soft syntactic labels and string-to-dependency. Our goal is to find out if these models complement each other of if they rather address the same deficiencies in the translation process. Furthermore, we present a novel method for extending the translation model in the same direction without the need for parse trees, since they may not be available for some languages. Our approach is based only on automatic clustering of phrases, without the need for additional information. Our findings show that we are able to achieve similar results as when applying syntax models.

Full Paper

Bibliographic reference.  Vilar, David / Stein, Daniel / Peitz, Stephan / Ney, Hermann (2010): "If i only had a parser: poor man's syntax for hierarchical machine translation", In IWSLT-2010, 345-352.