Odyssey 2012 - The Speaker and Language Recognition Workshop

June 25-28, 2012

Database Selection for Forensic Voice Comparison

Geoffrey Stewart Morrison, Felipe Ochoa, Tharmarajah Thiruvaran

Forensic Voice Comparison Laboratory, School of Electrical Engineering & Telecommunications, University of New South Wales, Australia

Defining the relevant population to sample is an important issue in data-based implementation of the likelihood-ratio framework for forensic voice comparison. We present a logical argument that because an investigator or prosecutor only submits suspect and offender recordings for forensic analysis if they sound sufficiently similar to each other, the appropriate defense hypothesis for the forensic scientist to adopt will usually be that the suspect is not the speaker on the offender recording but is a member of a population of speakers who sound sufficiently similar that an investigator or prosecutor would submit recordings of these speakers for forensic analysis. We propose a procedure for selecting background, development, and test databases using a panel of human listeners, and empirically test an automatic procedure inspired by the above. Although the automatic procedure is not entirely consistent with the logical arguments and human-listener procedure, it serves as a proof of concept for the importance of database selection. A forensicvoice- comparison system using the automatic database-selection procedure outperformed systems with random database selection.

Full Paper

Bibliographic reference.  Morrison, Geoffrey Stewart / Ochoa, Felipe / Thiruvaran, Tharmarajah (2012): "Database selection for forensic voice comparison", In Odyssey-2012, 62-77.