5th International Conference on Spoken Language Processing

Sydney, Australia
November 30 - December 4, 1998

Human vs. Machine Speaker Identification with Telephone Speech

Astrid Schmidt-Nielsen (1), Thomas H. Crystal (2)

(1) U.S. Naval Research Laboratory, USA
(2) IDA Center for Communications Research, USA

An experiment compared the speaker recognition performance of human listeners to that of computer algorithms/systems. Listening protocols were developed analogous to procedures used in the algorithm evaluation run by the U.S. National Institute of Standards and Technology (NIST), and the same telephone conversation data were used. For "same number" testing, with three-second samples, listener panels and the best algorithm had the same equal-error rate (EER) of 8%. Listeners were better than typical algorithms. For "different number" testing, EER's increased but humans had a 40% lower equal-error rate. Other observations on human listening performance and robustness to "degradations" were made.

Full Paper

Bibliographic reference.  Schmidt-Nielsen, Astrid / Crystal, Thomas H. (1998): "Human vs. machine speaker identification with telephone speech", In ICSLP-1998, paper 0148.