9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

T-Test Distance and Clustering Criterion for Speaker Diarization

Trung Hieu Nguyen (1), Eng Siong Chng (2), Haizhou Li (1)

(1) Institute for Infocomm Research, Singapore; (2) Nanyang Technological University, Singapore

In this paper, we present an application of student's t-test to measure the similarity between two speaker models. The measure is evaluated by comparing with other distance metrics: the Generalized Likelihood Ratio, the Cross Likelihood Ratio and the Normalized Cross Likelihood Ratio in speaker detection task. We also propose an objective criterion for speaker clustering. The criterion deduces the number of speakers automatically by maximizing the separation between intra-speaker distances and inter-speaker distances. It requires no development data and works well with various distance metrics. We then report the performance of our proposed similarity distance measure and objective criterion in speaker diarization task. The system produces competitive results: low speaker diarization error rate and high accuracy in detecting number of speakers.

Full Paper

Bibliographic reference.  Nguyen, Trung Hieu / Chng, Eng Siong / Li, Haizhou (2008): "T-test distance and clustering criterion for speaker diarization", In INTERSPEECH-2008, 36-39.