ISCA Archive IWSLT 2005
ISCA Archive IWSLT 2005

Machine translation evaluation inside QARLA

Jesús Giménez, Enrique Amigó, Chiori Hori

In this work we present the fundamentals of the IQMT framework for MT evaluation. IQMT offers a common workbench on which existing evaluation metrics can be utilized. We suggest the IQ measure and test it on the Chinese-to- English data from the IWSLT 2004 Evaluation Campaign. We show how the correlation with human assessments at the system level improves substantially for most individual metrics. Moreover, IQMT allows to robustly combine several metrics avoiding scaling problems and metric weightings. Several metric combinations were tried, but correlations did not further improve significantly.

Cite as: Giménez, J., Amigó, E., Hori, C. (2005) Machine translation evaluation inside QARLA. Proc. International Workshop on Spoken Language Translation (IWSLT 2005), 189-196

  author={Jesús Giménez and Enrique Amigó and Chiori Hori},
  title={{Machine translation evaluation inside QARLA}},
  booktitle={Proc. International Workshop on Spoken Language Translation (IWSLT 2005)},