8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Speech Recognition Error Analysis on the English MALACH Corpus

Olivier Siohan, Bhuvana Ramabhadran, Geoffrey Zweig

IBM T.J. Watson Research Center, USA

This paper presents an analysis of the word recognition error rate on an English subset of the MALACH corpus. The MALACH project is an NSF-funded research program related to the development of multilingual access to large audio archives. The archive is a large collection of testimonies from survivors of the Nazi Holocaust. This data has some unique characteristics such as elderly speech, noisy conditions, heavily accented speech. Hence, it is a challenging task for automatic speech recognition (ASR). This paper attempts to identify the factors affecting the ASR performance. It was found that the signal-to-noise ratio and syllable rate were two dominant factors in explaining the overall word error rate, while we observed no evidence of the impact of accent and speaker's age on the recognition performance. Based on this evidence, noise compensation experiments were carried out and led to a 1.1% absolute reduction of the word error rate.

Full Paper

Bibliographic reference.  Siohan, Olivier / Ramabhadran, Bhuvana / Zweig, Geoffrey (2004): "Speech recognition error analysis on the English MALACH corpus", In INTERSPEECH-2004, 413-416.