International Workshop on Spoken Language Translation (IWSLT) 2012

Hong Kong
December 6-7, 2012

The RWTH Aachen Speech Recognition and Machine Translation System for IWSLT 2012

Stephan Peitz, Saab Mansour, Markus Freitag, Minwei Feng, Matthias Huck, Joern Wuebker, Malte Nuhn, Markus Nußbaum-Thom, Hermann Ney

Human Language Technology and Pattern Recognition Group, Computer Science Department, RWTH Aachen University, Aachen, Germany

In this paper, the automatic speech recognition (ASR) and statistical machine translation (SMT) systems of RWTH Aachen University developed for the evaluation campaign of the International Workshop on Spoken Language Translation (IWSLT) 2012 are presented. We participated in the ASR (English), MT (English-French, Arabic-English, Chinese- English, German-English) and SLT (English-French) tracks. For the MT track both hierarchical and phrase-based SMT decoders are applied. A number of different techniques are evaluated in the MT and SLT tracks, including domain adaptation via data selection, translation model interpolation, phrase training for hierarchical and phrase-based systems, additional reordering model, word class language model, various Arabic and Chinese segmentation methods, postprocessing of speech recognition output with an SMT system, and system combination. By application of these methods we can show considerable improvements over the respective baseline systems.

Full Paper    Presentation

Bibliographic reference.  Peitz, Stephan / Mansour, Saab / Freitag, Markus / Feng, Minwei / Huck, Matthias / Wuebker, Joern / Nuhn, Malte / Nußbaum-Thom, Markus / Ney, Hermann (2012): "The RWTH Aachen speech recognition and machine translation system for IWSLT 2012", In IWSLT-2012, 69-76.