ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Dimensionality reduction of phone log-likelihood ratio features for spoken language recognition

Mireia Diez, Amparo Varona, Mikel Penagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel

In a previous work, we introduced the use of log-likelihood ratios of phone posterior probabilities, called Phone Log-Likelihood Ratios (PLLR) as features for language recognition under an iVector-based approach, yielding high performance and promising results. However, the high dimensionality of the PLLR feature vectors (with regard to MFCC/SDC features) results in comparatively higher computational costs. In this work, several supervised and unsupervised dimensionality reduction techniques are studied, based on either fusions or selection of phone posteriors, finding that PLLR feature vectors can be reduced to almost a third of their original size attaining similar performance. Finally, Principal Component Analysis (PCA) is also applied to the original PLLR vector as a feature projection method for comparison purposes. Results show that PCA stands out among all the techniques studied, revealing that it does not only reduce computational costs, but also improves system performance significantly.


doi: 10.21437/Interspeech.2013-39

Cite as: Diez, M., Varona, A., Penagarikano, M., Rodríguez-Fuentes, L.J., Bordel, G. (2013) Dimensionality reduction of phone log-likelihood ratio features for spoken language recognition. Proc. Interspeech 2013, 64-68, doi: 10.21437/Interspeech.2013-39

@inproceedings{diez13_interspeech,
  author={Mireia Diez and Amparo Varona and Mikel Penagarikano and Luis Javier Rodríguez-Fuentes and Germán Bordel},
  title={{Dimensionality reduction of phone log-likelihood ratio features for spoken language recognition}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={64--68},
  doi={10.21437/Interspeech.2013-39}
}