Sixth International Conference on Spoken Language Processing
This paper describes a database designed to evaluate the performance of speech recognition algorithms in noisy conditions. The database may either be used to measure frontend feature extraction algorithms, using a defined HMM recognition back-end, or complete recognition systems. The source speech for this database is the TIdigits, consisting of connected digits task spoken by American English talkers (downsampled to 8kHz). A selection of 8 different real-world noises have been added to the speech over a range of signal to noise ratios with controlled filtering of the speech and noise.
The framework was prepared as a contribution to the ETSI STQ-AURORA DSR Working Group. Aurora is developing standards for Distributed Speech Recognition (DSR) where the speech analysis is done in the telecommunication terminal and the recognition at a central location in the telecom network. The framework is currently being used to evaluate alternative proposals for front-end feature extraction. The database has been made publicly available through ELRA so that other speech researchers to evaluate and compare the performance of noise robust algorithms.
Recognition results will be presented for the first standard DSR feature extraction scheme based on a cepstral analysis.
Bibliographic reference. Pearce, David / Hirsch, Hans-GŁnter (2000): "The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions", In ICSLP-2000, vol.4, 29-32.