International Workshop on Spoken Language Translation (IWSLT) 2012

Hong Kong
December 6-7, 2012

Spoken Language Translation Using Automatically Transcribed Text in Training

Stephan Peitz, Simon Wiesler, Markus Nußbaum-Thom, Hermann Ney

Human Language Technology and Pattern Recognition, Computer Science Department, RWTH Aachen University, Aachen, Germany

In spoken language translation a machine translation system takes speech as input and translates it into another language. A standard machine translation system is trained on written language data and expects written language as input. In this paper we propose an approach to close the gap between the output of automatic speech recognition and the input of machine translation by training the translation system on automatically transcribed speech. In our experiments we show improvements of up to 0.9 BLEU points on the IWSLT 2012 English-to-French speech translation task.

Full Paper   

Bibliographic reference.  Peitz, Stephan / Wiesler, Simon / Nußbaum-Thom, Markus / Ney, Hermann (2012): "Spoken language translation using automatically transcribed text in training", In IWSLT-2012, 276-283.