2nd Workshop on Spoken Language Technologies for Under-Resourced Languages

Universiti Sains, Penang, Malaysia
May 3-5, 2010

Exploiting Morphology in Speech Translation with Phrase-Based Finite-State Transducers

Alicia Pérez (1), M. Inés Torres (1), Francisco Casacuberta (2)

(1) Department of Electricity and Electronics, University of the Basque Country, Spain
(2) Instituto Tecnológico de Informática Technical University of Valencia, Spain

This work implements a novel formulation for phrase-based translation models making use of morpheme-based translation units under a stochastic finite-state framework. This approach has an additional interest for speech translation tasks since it leads to the integration of the acoustic and translation models.

As a further contribution, this is the first paper addressing a Basque-to-Spanish speech translation task. For this purpose a morpheme based finite-state recognition system is combined with a finite-state transducer that translates phrases of morphemes in the source language into usual sequences of words in the target language.

The proposed models were assessed under a limiteddomain application task. Good performances were obtained for the proposed phrase-based finite-state translation model using morphemes as translation units, and also notable improvements are obtained in decoding time.

Index Terms: Speech Translation, Stochastic Finite- State Transducers, Morphology

Full Paper

Bibliographic reference.  Pérez, Alicia / Torres, M. Inés / Casacuberta, Francisco (2010): "Exploiting morphology in speech translation with phrase-based finite-state transducers", In SLTU-2010, 6-9.