Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

dPLRM-Based Speaker Identification with Log Power Spectrum

Tomoko Matsui, Kunio Tanabe

Institute of Statistical Mathematics, Japan

This paper investigates speaker identification with implicit extraction of speaker characteristics relevant to discrimination from the log power spectrum of training speech by employing the inductive power of dual Penalized Logistic Regression Machine (dPLRM). The dPLRM is one of kernel methods like the support vector machine (SVM) and has the inductive power due to the mechanism of the kernel regression. In text-independent speaker identification experiments with training speech uttered by 10 male speakers in three different sessions, we compares the performances of dPLRM, SVM and Gaussian mixture model (GMM)-based methods and show that dPLRM implicitly and effectively extracts speaker characteristics from the log power spectrum. It is also shown that dPLRM outperforms the other methods especially when the amount of training data is small.

Full Paper

Bibliographic reference.  Matsui, Tomoko / Tanabe, Kunio (2005): "dPLRM-based speaker identification with log power spectrum", In INTERSPEECH-2005, 2017-2020.