This paper describes our English Speech-to-Text (STT) system for the 2011 IWSLT ASR track. The system consists of 2 subsystems with different front-ends - one MVDR based, one MFCC based - which are combined using confusion network combination to provide a base for a second pass speaker adapted MVDR system. We demonstrate that this set-up produces competitive results on the IWSLT 2010 dev and test sets.
Cite as: Stüker, S., Kilgour, K., Saam, C., Waibel, A. (2011) The 2011 KIT English ASR system for the IWSLT evaluation. Proc. International Workshop on Spoken Language Translation (IWSLT 2011), 94-97
@inproceedings{stuker11_iwslt, author={Sebastian Stüker and Kevin Kilgour and Christian Saam and Alex Waibel}, title={{The 2011 KIT English ASR system for the IWSLT evaluation}}, year=2011, booktitle={Proc. International Workshop on Spoken Language Translation (IWSLT 2011)}, pages={94--97} }