Third European Conference on Speech Communication and Technology

Berlin, Germany
September 22-25, 1993


Measuring Similarities Among Speakers by Means of Neural Networks

J. A. Hernandez-Mendez, Anibal R. Figueiras-Vidal

GPSS/DSSR, ETSI Telecom-UPM, Ciudad Universitaria, Madrid, Spain

We compare several neural networks architectures to measure the degree of similarity among speakers. For each speaker of a reference set, Multilayer Perceptrons and Radial Basis Functions are trained to perform a non-linear principal component analysis of acoustic vectors, and Self-Organized Feature Maps are used to construct Vector Quantizers. As a first simple step, we use non-discriminant training to characterize speakers, and, then, the result is applied to combine speaker-dependent speech recognition models. In a second phase, discriminant training over speaker models is carried out, and speaker verification and identification performances of these networks are evaluated.

Keywords: Speech recognition, speaker recognition, neural networks, similarity measures, models.

Full Paper

Bibliographic reference.  Hernandez-Mendez, J. A. / Figueiras-Vidal, Anibal R. (1993): "Measuring similarities among speakers by means of neural networks", In EUROSPEECH'93, 643-646.