Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

Speaker Detection Using Acoustic Event Sequences

Nicolas Scheffer, Jean-François Bonastre

LIA-CNRS, Avignon, France

Novel approaches using high level features have recently shown up in the speaker recognition field. They basically consist in modeling speakers using linguistic features such as words, phonemes, idiolects. The benefit of these features was demonstrated in NIST campaigns. Their main disadvantage is their need of a huge amount of data to be efficient. The purpose of this study is to generalize this approach by using acoustic events, generated by a GMM, as input features. A methodology to build a dictionary and to model speakers using symbol sequences from this dictionary is derived. Different experiments on NIST SRE 2004 database show that the information produced is speaker specific and that a fusion experiment with a GMM verification system improves performance.

Full Paper

Bibliographic reference.  Scheffer, Nicolas / Bonastre, Jean-François (2005): "Speaker detection using acoustic event sequences", In INTERSPEECH-2005, 3065-3068.