Sixth European Conference on Speech Communication and Technology
The collection of telephone databases, for training speech recognisers, is a time consuming and costly work. In the paper we propose a method for producing simulated telephone data starting from clean wide band databases. The result of the simulation is the generation of a noisy database that can be used, in addition to other techniques, for compensating or adapting speech recogniser parameters with respect to different test environments. For the first of the two adopted test sets, performance improvements ranging from about 30% to about 9% have been measured, as a function of the quantity of real telephone data used, in addition to the simulated ones, for system training. For the second test set no significant improvements were obtained.
Full Paper (PDF) Gnu-Zipped Postscript
Bibliographic reference. Tarcisio, Coianiz / Daniele, Falavigna / Roberto, Gretter / Marco, Orlandi (1999): "Use of simulated data for robust telephone speech recognition", In EUROSPEECH'99, 2825-2828.