INTERSPEECH 2009
10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Speaker Recognition by Gaussian Information Bottleneck

Ron M. Hecht (1), Elad Noor (2), Naftali Tishby (3)

(1) Tel-Aviv University, Israel
(2) Weizmann Institute of Science, Israel
(3) Hebrew University, Israel

This paper explores a novel approach for the extraction of relevant information in speaker recognition tasks. This approach uses a principled information theoretic framework the Information Bottleneck method (IB). In our application, the method compresses the acoustic data while preserving mostly the relevant information for speaker identification. This paper focuses on a continuous version of the IB method known as the Gaussian Information Bottleneck (GIB). This version assumes that both the source and target variables are high dimensional multivariate Gaussian variables. The GIB was applied in our work to the Super Vector (SV) dimension reduction conundrum. Experiments were conducted on the male part of the NIST SRE 2005 corpora. The GIB representation was compared to other dimension reduction techniques and to a baseline system. In our experiments, the GIB outperformed the baseline system; achieving a 6.1% Equal Error Rate (EER) compared to the 15.1% EER of a baseline system.

Full Paper

Bibliographic reference.  Hecht, Ron M. / Noor, Elad / Tishby, Naftali (2009): "Speaker recognition by Gaussian information bottleneck", In INTERSPEECH-2009, 1567-1570.