EUROSPEECH 2003 - INTERSPEECH 2003
This paper describes experiments carried out to automatically transcribe football commentaries in Dutch, English and German for multimedia indexing. Our results show that the high levels of stadium noise in the material create a task that is extremely difficult for conventional ASR. The baseline WERs vary from 83% to 94% for the three languages investigated. Employing state-of-the-art noise robustness techniques leads to relative reductions of 9-10% WER. Application specific words such as players' names are recognized correctly in about 50% of cases. Although this result is substantially better than the overall result, it is inadequate. Much better results can be obtained if the football commentaries are recorded separately from the stadium noise. This would make the automatic transcriptions more useful for multimedia indexing.
Bibliographic reference. Sturm, Janienke / Kessens, Judith M. / Wester, Mirjam / Wet, Febe de / Sanders, Eric / Strik, Helmer (2003): "Automatic transcription of football commentaries in the MUMIS project", In EUROSPEECH-2003, 1853-1856.