Local binary patterns as features for speaker recognition

Waad Ben Kheder, Driss Matrouf, Moez Ajili, Jean-Francois Bonastre


The i-vector framework witnessed great success in the past years in speaker recognition (SR). The feature extraction process is central in SR systems and many features have been developed over the years to improve the recognition performance. In this paper, we present a new feature representation which borrows a concept initially developed in computer vision to characterize textures called Local Binary Patterns (LBP). We explore the use of LBP as features for speaker recognition and show that using them as descriptors for cepstral coefficients dynamics (replacing Delta and Delta-Delta in the regular MFCC representation) results in more efficient features and yield up to 15% of relative improvement compared to the baseline system performance in both clean and noisy conditions.


DOI: 10.21437/Odyssey.2016-50

Cite as

Kheder, W.B., Matrouf, D., Ajili, M., Bonastre, J. (2016) Local binary patterns as features for speaker recognition. Proc. Odyssey 2016, 346-351.

Bibtex
@inproceedings{Kheder+2016,
author={Waad Ben Kheder and Driss Matrouf and Moez Ajili and Jean-Francois Bonastre},
title={Local binary patterns as features for speaker recognition},
year=2016,
booktitle={Odyssey 2016},
doi={10.21437/Odyssey.2016-50},
url={http://dx.doi.org/10.21437/Odyssey.2016-50},
pages={346--351}
}