The Use of Locally Normalized Cepstral Coefficients (LNCC) to Improve Speaker Recognition Accuracy in Highly Reverberant Rooms

Víctor Poblete, Juan Pablo Escudero, Josué Fredes, José Novoa, Richard M. Stern, Simon King, Néstor Becerra Yoma


We describe the ability of LNCC features (Locally Normalized Cepstral Coefficients) to improve speaker recognition accuracy in highly reverberant environments. We used a realistic test environment, in which we changed the number and nature of reflective surfaces in the room, creating four increasingly reverberant times from approximately 1 to 9 seconds. In this room, we re-recorded reverberated versions of the Yoho speaker verification corpus. The recordings were made using four speaker-to-microphone distances, from 0.32m to 2.56m. Experimental results for a speaker verification task suggest that LNCC features are an attractive alternative to MFCC features under such reverberant conditions, as they were observed to improve verification accuracy compared to baseline MFCC features in all cases where the reverberation time exceeded 1 second or with a greater speaker-microphone distance (i.e. 2.56 m).


DOI: 10.21437/Interspeech.2016-1277

Cite as

Poblete, V., Escudero, J.P., Fredes, J., Novoa, J., Stern, R.M., King, S., Yoma, N.B. (2016) The Use of Locally Normalized Cepstral Coefficients (LNCC) to Improve Speaker Recognition Accuracy in Highly Reverberant Rooms. Proc. Interspeech 2016, 2373-2377.

Bibtex
@inproceedings{Poblete+2016,
author={Víctor Poblete and Juan Pablo Escudero and Josué Fredes and José Novoa and Richard M. Stern and Simon King and Néstor Becerra Yoma},
title={The Use of Locally Normalized Cepstral Coefficients (LNCC) to Improve Speaker Recognition Accuracy in Highly Reverberant Rooms},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-1277},
url={http://dx.doi.org/10.21437/Interspeech.2016-1277},
pages={2373--2377}
}