International Workshop on Spoken Language Translation (IWSLT) 2010

Paris, France
December 2-3, 2010

Hierarchical Phrase-based Translation with Weighted Finite State Transducers

William Byrne

Cambridge University, UK

I will present recent work in statistical machine translation which uses Weighted Finite-State Transducers (WFSTs) to implement a variety of search and estimation algorithms. I will describe HiFST, a lattice-based decoder for hierarchical phrase-based statistical machine translation. The decoder is implemented with standard WFST operations as an alternative to the well-known cube pruning procedure.We find that the use of WFSTs in translation leads to fewer search errors, better parameter optimization, and improved translation performance. We also find that the direct generation of target language lattices under Hiero translation grammars can improve subsequent rescoring procedures, yielding further gains with long-span language models and Minimum Bayes Risk decoding.

Presentation

Bibliographic reference.  Byrne, William (2010): "Hierarchical phrase-based translation with weighted finite state transducers", In IWSLT-2010, 402.