Defining the relevant population to sample is an important issue in data-based implementation of the likelihood-ratio framework for forensic voice comparison. We present a logical argument that because an investigator or prosecutor only submits suspect and offender recordings for forensic analysis if they sound sufficiently similar to each other, the appropriate defense hypothesis for the forensic scientist to adopt will usually be that the suspect is not the speaker on the offender recording but is a member of a population of speakers who sound sufficiently similar that an investigator or prosecutor would submit recordings of these speakers for forensic analysis. We propose a procedure for selecting background, development, and test databases using a panel of human listeners, and empirically test an automatic procedure inspired by the above. Although the automatic procedure is not entirely consistent with the logical arguments and human-listener procedure, it serves as a proof of concept for the importance of database selection. A forensicvoice- comparison system using the automatic database-selection procedure outperformed systems with random database selection.
Cite as: Morrison, G.S., Ochoa, F., Thiruvaran, T. (2012) Database selection for forensic voice comparison. Proc. The Speaker and Language Recognition Workshop (Odyssey 2012), 62-77
@inproceedings{morrison12_odyssey, author={Geoffrey Stewart Morrison and Felipe Ochoa and Tharmarajah Thiruvaran}, title={{Database selection for forensic voice comparison}}, year=2012, booktitle={Proc. The Speaker and Language Recognition Workshop (Odyssey 2012)}, pages={62--77} }