Interspeech'2005 - Eurospeech
In this paper we propose the inversion of nonlinear distortions in order to improve the recognition rates of a speaker recognizer system. We study the effect of saturations on the test signals, trying to take into account real situations where the training material has been recorded in a controlled situation but the testing signals present some mismatch with the input signal level (saturations). The experimental results shows that a combination of several strategies can reduce the minimum detection cost function with saturated test sentences from 6.42% to 4.15%, while the results with clean speech (without saturation) is 5.74% for one microphone and 7.02% for the other one.
Bibliographic reference. Faúndez-Zanuy, Marcos / Solé-Casals, Jordi (2005): "Speaker verification improvement using blind inversion of distortions", In INTERSPEECH-2005, 2049-2052.