In human-mediated translation scenarios a human interpreter translates between a source and a target language using either a spoken or a written representation of the source language. In this paper we improve the recognition performance on the speech of the human translator spoken in the target language by taking advantage of the source language representations. We use machine translation techniques to translate between the source and target language resources and then bias the target language speech recognizer towards the gained knowledge, hence the name Machine Translation Enhanced Automatic Speech Recognition. We investigate several different techniques among which are restricting the search vocabulary, selecting hypotheses from n-best lists, applying cache and interpolation schemes to language modeling, and combining the most successful techniques into our final, iterative system. Overall we outperform the baseline system by a relative word error rate reduction of 37.6%.
Cite as: Paulik, M., Fügen, C., Stüker, S., Schultz, T., Schaaf, T., Waibel, A. (2005) Document driven machine translation enhanced ASR. Proc. Interspeech 2005, 2261-2264, doi: 10.21437/Interspeech.2005-720
@inproceedings{paulik05_interspeech, author={M. Paulik and Christian Fügen and Sebastian Stüker and Tanja Schultz and Thomas Schaaf and Alex Waibel}, title={{Document driven machine translation enhanced ASR}}, year=2005, booktitle={Proc. Interspeech 2005}, pages={2261--2264}, doi={10.21437/Interspeech.2005-720} }