SAPA-SCALE Conference 2012

Portland, OR, USA
September 7-8, 2012

Dimensionality Reduction of Large TDOA Vectors for Speaker Diarization

Deepu Vijayasenan (1), Fabio Valente (2)

(1) Universit¨at des Saarlandes, Saarbrücken, Germany
(2) Idiap Research Institue, Martigny, Switzerland

In this work, we investigate a dimensionality reduction scheme to use Time Delay of Arrival(TDOA) features across all microphones in a traditional HMM/GMM system. The subspace dimension is selected based on dimension of the TDOA vectors in an ideal recording, i.e., without environmental distortion or interference. Experiments in a dataset used in NIST Meeting Diarization evaluation reveal that the dimensionality reduction to a considerably lower dimension improve the diarization error by 3.7%(30% relative). While the proposed scheme has the advantage that it does not require any development set tuning to select the dimension as proposed by previous methods, it retains competitive performance (5% better than tuning the results).

Index Terms: Speaker diarization, Time Delay of Arrival, Dimensionality reduction

Full Paper

Bibliographic reference.  Vijayasenan, Deepu / Valente, Fabio (2012): "Dimensionality reduction of large TDOA vectors for speaker diarization", In SAPA-SCALE-2012, 64-67.