14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Dimensionality Reduction of Phone Log-Likelihood Ratio Features for Spoken Language Recognition

Mireia Diez, Amparo Varona, Mikel Penagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel

Universidad del País Vasco, Spain

In a previous work, we introduced the use of log-likelihood ratios of phone posterior probabilities, called Phone Log-Likelihood Ratios (PLLR) as features for language recognition under an iVector-based approach, yielding high performance and promising results. However, the high dimensionality of the PLLR feature vectors (with regard to MFCC/SDC features) results in comparatively higher computational costs. In this work, several supervised and unsupervised dimensionality reduction techniques are studied, based on either fusions or selection of phone posteriors, finding that PLLR feature vectors can be reduced to almost a third of their original size attaining similar performance. Finally, Principal Component Analysis (PCA) is also applied to the original PLLR vector as a feature projection method for comparison purposes. Results show that PCA stands out among all the techniques studied, revealing that it does not only reduce computational costs, but also improves system performance significantly.

Full Paper

Bibliographic reference.  Diez, Mireia / Varona, Amparo / Penagarikano, Mikel / Rodríguez-Fuentes, Luis Javier / Bordel, Germán (2013): "Dimensionality reduction of phone log-likelihood ratio features for spoken language recognition", In INTERSPEECH-2013, 64-68.