A Feature Normalisation Technique for PLLR Based Language Identification Systems

Sarith Fernando, Vidhyasaharan Sethu, Eliathamby Ambikairajah


Phone log-likelihood ratio (PLLR) features have been shown to be effective in language identification systems. However, PLLR feature distributions are bounded and this may contradict assumptions of Gaussianity and consequently lead to reduced language recognition rates. In this paper, we propose a feature normalisation technique for the PLLR feature space and demonstrate that it can outperform conventional normalisation and decorrelation techniques such as mean-variance normalisation, feature warping, discrete cosine transform and principal component analysis. Experimental results on the NIST LRE 2007 and the NIST LRE 2015 databases show that the proposed method outperforms other normalisation methods by at least 9.3% in terms of %Cavg. Finally, unlike PCA which needs to be estimated from all the training data, the proposed technique can be applied on each utterance independently.


DOI: 10.21437/Interspeech.2016-560

Cite as

Fernando, S., Sethu, V., Ambikairajah, E. (2016) A Feature Normalisation Technique for PLLR Based Language Identification Systems. Proc. Interspeech 2016, 2925-2929.

Bibtex
@inproceedings{Fernando+2016,
author={Sarith Fernando and Vidhyasaharan Sethu and Eliathamby Ambikairajah},
title={A Feature Normalisation Technique for PLLR Based Language Identification Systems},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-560},
url={http://dx.doi.org/10.21437/Interspeech.2016-560},
pages={2925--2929}
}