EUROSPEECH 2003 - INTERSPEECH 2003
8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003

        

Score Normalisation Applied to Open-Set, Text-Independent Speaker Identification

P. Sivakumaran (1), J. Fortuna (2), Aladdin M. Ariyaeeinia (2)

(1) 20/20 Speech Ltd., U.K.
(2) University of Hertfordshire, U.K.

This paper presents an investigation into the relative effectiveness of various score normalisation methods for open-set, text-independent speaker identification. The paper describes the need for score normalisation in this case, and provides a detailed theoretical and experimental analysis of the methods that can be used for this purpose. The experimental investigations are based on the use of speech material drawn from 9 hours of recordings of different Broadcast News. The results clearly demonstrate the significance of improvement offered by score normalisation. It is shown that, amongst various normalisation methods considered, the unconstrained cohort normalisation method achieves the best performance in terms of reducing the errors associated with the open-set nature of the process. Furthermore, it is demonstrated that both the cohort and world model methods can offer very similar effectiveness, and also outperform the T-norm method in this particular case of speaker recognition.

Full Paper

Bibliographic reference.  Sivakumaran, P. / Fortuna, J. / Ariyaeeinia, Aladdin M. (2003): "Score normalisation applied to open-set, text-independent speaker identification", In EUROSPEECH-2003, 2669-2672.