ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Machine translation in continuous space

Ruhi Sarikaya, Yonggang Deng, Mohamed Afify, Brian Kingsbury, Yuqing Gao

We present a different perspective on the machine translation problem that relies upon continuous-space probabilistic models for words and phrases. Within this perspective we propose a method called Tied-Mixture Machine Translation (TMMT) that uses a trainable parametric model employing Gaussian mixture probability density functions to represent word- and phrase-pairs. In the new perspective, machine translation is treated in the same way as acoustic modeling in speech recognition. This new treatment carries several potential advantages that may improve state-of-theart machine translation systems, including better generalization to unseen events; adaptation to new domains, languages, genres, and speakers via methods such as Maximum-Likelihood Linear Regression (MLLR); and improved discrimination through discriminative training methods such as Maximum Mutual Information Estimation (MMIE). Our goal in this paper, however, is to introduce the new approach and demonstrate its viability, leaving investigation of some of the potential advantages to future work. To this end, we report some preliminary experiments demonstrating the viability of the proposed method.

doi: 10.21437/Interspeech.2008-599

Cite as: Sarikaya, R., Deng, Y., Afify, M., Kingsbury, B., Gao, Y. (2008) Machine translation in continuous space. Proc. Interspeech 2008, 2350-2353, doi: 10.21437/Interspeech.2008-599

  author={Ruhi Sarikaya and Yonggang Deng and Mohamed Afify and Brian Kingsbury and Yuqing Gao},
  title={{Machine translation in continuous space}},
  booktitle={Proc. Interspeech 2008},