ISCA Archive IWSLT 2010
ISCA Archive IWSLT 2010

The NICT translation system for IWSLT 2010

Chooi-Ling Goh, Taro Watanabe, Michael Paul, Andrew Finch, Eiichiro Sumita

This paper describes NICT's participation in the IWSLT 2010 evaluation campaign for the DIALOG translation (Chinese-English) and the BTEC (French-English) translation shared-tasks. For the DIALOG translation, the main challenge to this task is applying context information during translation. Context information can be used to decide on word choice and also to replace missing information during translation. We applied discriminative reranking using contextual information as additional features. In order to provide more choices for re-ranking, we generated n-best lists from multiple phrase-based statistical machine translation systems that varied in the type of Chinese word segmentation schemes used. We also built a model that merged the phrase tables generated by the different segmentation schemes. Furthermore, we used a lattice-based system combination model to combine the output from different systems. A combination of all of these systems was used to produce the n-best lists for re-ranking. For the BTEC task, a general approach that used latticebased system combination of two systems, a standard phrasebased system and a hierarchical phrase-based system, was taken. We also tried to process some unknown words by replacing them with the same words but different inflections that are known to the system.

Cite as: Goh, C.-L., Watanabe, T., Paul, M., Finch, A., Sumita, E. (2010) The NICT translation system for IWSLT 2010. Proc. International Workshop on Spoken Language Translation (IWSLT 2010), 139-146

  author={Chooi-Ling Goh and Taro Watanabe and Michael Paul and Andrew Finch and Eiichiro Sumita},
  title={{The NICT translation system for IWSLT 2010}},
  booktitle={Proc. International Workshop on Spoken Language Translation (IWSLT 2010)},