The project Technology and Corpora for Speech to Speech Translation (TC-STAR) aims at making a break-through in speech-to-speech translation research, significantly reducing the gap between the performance of machines and humans at this task. Technological and scientific progress is driven by periodic, competitive evaluations within the project. In this paper we describe the ISL speech transcription system for English European Parliament speeches with which we participated in the third TC-STAR evaluation campaign in the spring of 2007. The improvements over last year's system originate from a recognition hypotheses based segmentation, the utilization of unsupervised in-domain training material, a modified cross-system adaptation and combination scheme, and the enhancement of the language model through the use of web based training material.
Cite as: Stüker, S., Fügen, C., Kraft, F., Wölfel, M. (2007) The ISL 2007 English speech transcription system for european parliament speeches. Proc. Interspeech 2007, 2609-2612, doi: 10.21437/Interspeech.2007-588
@inproceedings{stuker07_interspeech, author={Sebastian Stüker and Christian Fügen and Florian Kraft and Matthias Wölfel}, title={{The ISL 2007 English speech transcription system for european parliament speeches}}, year=2007, booktitle={Proc. Interspeech 2007}, pages={2609--2612}, doi={10.21437/Interspeech.2007-588} }