ISCA Archive IWSLT 2008
ISCA Archive IWSLT 2008

The tÜbİTAK-UEKAE statistical machine translation system for IWSLT 2008

Coşkun Mermer, Hamza Kaya, Ömer Farukhan Güneş, Mehmet Uğur Doğan

We present the TÜBİTAK-UEKAE statistical machine translation system that participated in the IWSLT 2008 evaluation campaign. Our system is based on the opensource phrase-based statistical machine translation software Moses. Additionally, phrase-table augmentation is applied to maximize source language coverage; lexical approximation is applied to replace out-of-vocabulary words with known words prior to decoding; and automatic punctuation insertion is improved. We describe the preprocessing and postprocessing steps and our training and decoding procedures. Results are presented on our participation in the classical Arabic-English and Chinese-English tasks as well as the new Chinese-Spanish direct and Chinese-English-Spanish pivot translation tasks.


Cite as: Mermer, C., Kaya, H., Güneş, Ö.F., Doğan, M.U. (2008) The tÜbİTAK-UEKAE statistical machine translation system for IWSLT 2008. Proc. International Workshop on Spoken Language Translation (IWSLT 2008), 138-142

@inproceedings{mermer08_iwslt,
  author={Coşkun Mermer and Hamza Kaya and Ömer Farukhan Güneş and Mehmet Uğur Doğan},
  title={{The tÜbİTAK-UEKAE statistical machine translation system for IWSLT 2008}},
  year=2008,
  booktitle={Proc. International Workshop on Spoken Language Translation (IWSLT 2008)},
  pages={138--142}
}