ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Improving statistical machine translation by classifying and generalizing inflected verb forms

Adrià de Gispert, José B. Mariño, Josep M. Crego

This paper introduces a rule-based classification of single-word and compound verbs into a statistical machine translation approach. By substituting verb forms by the lemma of their head verb, the data sparseness problem caused by highly-inflected languages can be successfully addressed. On the other hand, the information of seen verb forms can be used to generate new translations for unseen verb forms. Translation results for an English to Spanish task are reported, producing a significant performance improvement.


doi: 10.21437/Interspeech.2005-730

Cite as: Gispert, A.d., Mariño, J.B., Crego, J.M. (2005) Improving statistical machine translation by classifying and generalizing inflected verb forms. Proc. Interspeech 2005, 3193-3196, doi: 10.21437/Interspeech.2005-730

@inproceedings{gispert05_interspeech,
  author={Adrià de Gispert and José B. Mariño and Josep M. Crego},
  title={{Improving statistical machine translation by classifying and generalizing inflected verb forms}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={3193--3196},
  doi={10.21437/Interspeech.2005-730}
}