Sixth European Conference on Speech Communication and Technology
(EUROSPEECH'99)

Budapest, Hungary
September 5-9, 1999

Use of Simulated Data for Robust Telephone Speech Recognition

Coianiz Tarcisio, Falavigna Daniele, Gretter Roberto, Orlandi Marco

IRST Istituto per la Ricerca Scientifica e Tecnologica, Panté di Povo, Trento, Italy

The collection of telephone databases, for training speech recognisers, is a time consuming and costly work. In the paper we propose a method for producing simulated telephone data starting from clean wide band databases. The result of the simulation is the generation of a noisy database that can be used, in addition to other techniques, for compensating or adapting speech recogniser parameters with respect to different test environments. For the first of the two adopted test sets, performance improvements ranging from about 30% to about 9% have been measured, as a function of the quantity of real telephone data used, in addition to the simulated ones, for system training. For the second test set no significant improvements were obtained.


Full Paper (PDF)   Gnu-Zipped Postscript

Bibliographic reference.  Tarcisio, Coianiz / Daniele, Falavigna / Roberto, Gretter / Marco, Orlandi (1999): "Use of simulated data for robust telephone speech recognition", In EUROSPEECH'99, 2825-2828.