ESCA Workshop on Automatic Speaker Recognition, Identification, and Verification

Martigny, Switzerland
April 7-9, 1994

Confidence Analysis for Speaker Identification: The Effectiveness of Various Features

Sherman Ong, Miles P. Moody, Sridha Sridharan

Signal Processing Research Centre, Queensland University of Technology, Brisbane, Australia

"Confidence analysis" was added to a speaker identification procedure to enable us to get the confidence that the target is indeed present in or absent from the reference list. The degree of confidence was deduced from an analysis of the differences between two groups of curves, namely AvD (Accuracy versos Distance) and PAvD (Pseudo Accuracy versus Distance). AvD was calculated from the percentage of cumulative matches versus intraspeaker distance. PAvD was obtained the same way with the exception that a speaker's test vectors were not compared against his personal reference template. Instead, another candidate with the most pseudo matches was selected to calculate PAvD.

In this paper, we investigate the effectiveness of four sets of acoustic features (autocorrelation coefficients, reflection coefficients, cepstral coefficients, and log area ratio coefficients) applied to the confidence analysis. Experiments reveal that: (1) autocorrelation coefficients set is not effective, (2) reflection coefficients set generally excels against any other feature sets, and (3) for some specific speakers, log area ratio coefficients set and cepstral coefficients set demonstrate their respective superiorities.

Full Paper

Bibliographic reference.  Ong, Sherman / Moody, Miles P. / Sridharan, Sridha (1994): "Confidence analysis for speaker identification: the effectiveness of various features", In ASRIV-1994, 91-94.