SAPA-SCALE Conference 2012
Portland, OR, USA
In this work, we investigate a dimensionality reduction scheme to use Time Delay of Arrival(TDOA) features across all microphones in a traditional HMM/GMM system. The subspace dimension is selected based on dimension of the TDOA vectors in an ideal recording, i.e., without environmental distortion or interference. Experiments in a dataset used in NIST Meeting Diarization evaluation reveal that the dimensionality reduction to a considerably lower dimension improve the diarization error by 3.7%(30% relative). While the proposed scheme has the advantage that it does not require any development set tuning to select the dimension as proposed by previous methods, it retains competitive performance (5% better than tuning the results).
Index Terms: Speaker diarization, Time Delay of Arrival, Dimensionality reduction
Bibliographic reference. Vijayasenan, Deepu / Valente, Fabio (2012): "Dimensionality reduction of large TDOA vectors for speaker diarization", In SAPA-SCALE-2012, 64-67.