11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

A Novel Speaker Binary Key Derived from Anchor Models

Xavier Anguera (1), Jean-François Bonastre (2)

(1) Telefónica I+D, Spain
(2) LIA, France

The approach presented in this paper represents voice recordings by a novel acoustic key composed only of binary values. Except for the process being used to extract such keys, there is no need for acoustic modeling and processing in the approach proposed, as all the other elements in the system are based on the binary vectors. We show that this binary key is able to effectively model a speaker's voice and to distinguish it from other speakers. Its main properties are its small size compared to current speaker modeling techniques and its low computational cost when comparing different speakers as it is limited to obtaining a similarity metric between two binary vectors. Furthermore, the binary key vector extraction process does not need any threshold and offers the opportunity to set the decision steps in a well defined binary domain where scores and decisions are easy to interpret and implement.

Full Paper

Bibliographic reference.  Anguera, Xavier / Bonastre, Jean-François (2010): "A novel speaker binary key derived from anchor models", In INTERSPEECH-2010, 2118-2121.