International Workshop on Spoken Language Translation (IWSLT) 2012
In this paper, the automatic speech recognition (ASR) and statistical machine translation (SMT) systems of RWTH Aachen University developed for the evaluation campaign of the International Workshop on Spoken Language Translation (IWSLT) 2012 are presented. We participated in the ASR (English), MT (English-French, Arabic-English, Chinese- English, German-English) and SLT (English-French) tracks. For the MT track both hierarchical and phrase-based SMT decoders are applied. A number of different techniques are evaluated in the MT and SLT tracks, including domain adaptation via data selection, translation model interpolation, phrase training for hierarchical and phrase-based systems, additional reordering model, word class language model, various Arabic and Chinese segmentation methods, postprocessing of speech recognition output with an SMT system, and system combination. By application of these methods we can show considerable improvements over the respective baseline systems.
Full Paper Presentation
Bibliographic reference. Peitz, Stephan / Mansour, Saab / Freitag, Markus / Feng, Minwei / Huck, Matthias / Wuebker, Joern / Nuhn, Malte / Nußbaum-Thom, Markus / Ney, Hermann (2012): "The RWTH Aachen speech recognition and machine translation system for IWSLT 2012", In IWSLT-2012, 69-76.