ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Integration of TDOA features in information bottleneck framework for fast speaker diarization

Deepu Vijayasenan, Fabio Valente, Hervé Bourlard

In this paper we address the combination of multiple feature streams in a fast speaker diarization system for meeting recordings. Whenever Multiple Distant Microphones (MDM) are used, it is possible to estimate the Time Delay of Arrival (TDOA) for different channels. In [1], it is shown that TDOA can be used as additional features together with conventional spectral features for improving speaker diarization. We investigate here the combination of TDOA and spectral features in a fast diarization system based on the Information Bottleneck principle. We evaluate the algorithm on the NIST RT06 diarization task. Adding TDOA features to spectral features reduces the speaker error by 7% absolute. Results are comparable to those of conventional HMM/GMM based systems with consistent reduction in computational complexity.


doi: 10.21437/Interspeech.2008-8

Cite as: Vijayasenan, D., Valente, F., Bourlard, H. (2008) Integration of TDOA features in information bottleneck framework for fast speaker diarization. Proc. Interspeech 2008, 40-43, doi: 10.21437/Interspeech.2008-8

@inproceedings{vijayasenan08_interspeech,
  author={Deepu Vijayasenan and Fabio Valente and Hervé Bourlard},
  title={{Integration of TDOA features in information bottleneck framework for fast speaker diarization}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={40--43},
  doi={10.21437/Interspeech.2008-8}
}