ODYSSEY 2004 - The Speaker and Language Recognition Workshop

May 31 - June 3, 2004
Toledo, Spain

Relative Effectiveness of Score Normalisation Methods in Open-Set Speaker Identification

J. Fortuna (1), P. Sivakumaran (2), A. M. Ariyaeeinia (1), A. Malegaonkar (1)

(1) University of Hertfordshire, UK
(2) Canon Research Centre Europe Ltd, Bracknell, Berkshire, UK

This paper presents an investigation into the relative effectiveness of various well-known score normalisation methods in the context of open-set, text-independent speaker identification. The scope of the study includes a thorough experimental analysis of the performance of the methods considered. The experimental investigations are based on the use of the dataset proposed for the 1-speaker detection task of the NIST Speaker Recognition Evaluation 2003. The results clearly demonstrate that significant benefits can be achieved by using score normalisation in open-set identification, and that the level of this depends highly on the type of the approach adopted. Based on the experimental results, it is found that amongst the various normalisation methods considered, those which are based on the Bayesian solution provide the best performance. In particular, the unconstrained cohort method with a small cohort size appears to outperform all other approaches. The paper provides a detailed description of the experimental set up, and presents an analysis of the results obtained.

Full Paper

Bibliographic reference.  Fortuna, J. / Sivakumaran, P. / Ariyaeeinia, A. M. / Malegaonkar, A. (2004): "Relative effectiveness of score normalisation methods in open-set speaker identification", In ODYS-2004, 369-376.