In this paper we propose the inversion of nonlinear distortions in order to improve the recognition rates of a speaker recognizer system. We study the effect of saturations on the test signals, trying to take into account real situations where the training material has been recorded in a controlled situation but the testing signals present some mismatch with the input signal level (saturations). The experimental results shows that a combination of several strategies can reduce the minimum detection cost function with saturated test sentences from 6.42% to 4.15%, while the results with clean speech (without saturation) is 5.74% for one microphone and 7.02% for the other one.
Cite as: Faúndez-Zanuy, M., Solé-Casals, J. (2005) Speaker verification improvement using blind inversion of distortions. Proc. Interspeech 2005, 2049-2052, doi: 10.21437/Interspeech.2005-640
@inproceedings{faundezzanuy05_interspeech, author={Marcos Faúndez-Zanuy and Jordi Solé-Casals}, title={{Speaker verification improvement using blind inversion of distortions}}, year=2005, booktitle={Proc. Interspeech 2005}, pages={2049--2052}, doi={10.21437/Interspeech.2005-640} }