EUROSPEECH 2003 - INTERSPEECH 2003
8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003

        

Auditory-Instrumental Forensic Speaker Recognition

Stefan Gfroerer

Bundeskriminalamt, Germany

The most prominent part in forensic speech and audio processing is speaker recognition. In the world a number of approaches to forensic speaker recognition (FSR) have been developed, that are different in terms of technical procedures, methodology, instrumentation and also in terms of the probability scale on which the final conclusion is based. The BKA's approach to speaker recognition is a combination of classical phonetic analysis techniques including analytical listening by an expert and the use of signal processing techniques within an acoustic-phonetic framework. This combined auditory-instrumental method includes acoustic measurements of parameters which may be interpreted using statistical information on their distributions, e.g. probability distributions of average fundamental frequency for adult males and females, average syllable rates as indicators of speech rate, etc. In a voice comparison report the final conclusion is determined by a synopsis of the results from auditory and acoustic parameters, amounting to about eight to twelve on average, depending on the nature of the speech material. Results are given in the form of probability statements. The paper gives an overview of current procedures and specific problems of FSR.

Full Paper

Bibliographic reference.  Gfroerer, Stefan (2003): "Auditory-instrumental forensic speaker recognition", In EUROSPEECH-2003, 705-708.