Sixth International Conference on Spoken Language Processing
The problem of machine translation can be viewed as consisting of two phases: (a) lexical choice phase where appropriate target language lexical items (words or phrases) are chosen for each source language lexical item and (b) reordering phase where the chosen target language lexical items are reordered to produce a meaningful target language string. In earlier work we have shown that finite-state models for lexical choice can be learned from bilingual corpora . In this paper, we focus on stochastic finite-state models for lexical reordering and describe an algorithm to learn them from bilingual corpora. We have developed a stochastic finite-state English-Japanese translation system by composing finite-state lexical choice and lexical reordering model. We have evaluated it using the string edit distance of the translated string from a given reference string. Using this metric, the English-Japanese translation system scored 70.9% on English speech transcriptions.
Bibliographic reference. Bangalore, Srinivas / Riccardi, Giuseppe (2000): "Finite-state models for lexical reordering in spoken language translation", In ICSLP-2000, vol.4, 422-425.