9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Forensic Speaker Recognition in Chinese: A Multivariate Likelihood Ratio Discrimination on /i/ and /y/

Cuiling Zhang (1), Geoffrey Stewart Morrison (2), Philip Rose (2)

(1) China Criminal Police University, China; (2) Australian National University, Australia

A likelihood-ratio-based forensic speaker discrimination was conducted using the mean formant frequencies of Standard Chinese /i/ and /y/ tokens produced by 64 male speakers. The speech data were relatively forensically realistic in that they were relatively extemporaneous, were recorded over the telephone, and were from three non-contemporaneous recording sessions. A multivariate-kernel-density formula was used to calculate crossvalidated likelihood ratios comparing all possible same-speaker and different-speaker combinations across sessions. Results were comparable with those previously obtained with laboratory speech in other languages. In general, greater strength of evidence was obtained for recording sessions separated by one week than for recording sessions separated by one month.

Full Paper

Bibliographic reference.  Zhang, Cuiling / Morrison, Geoffrey Stewart / Rose, Philip (2008): "Forensic speaker recognition in Chinese: a multivariate likelihood ratio discrimination on /i/ and /y/", In INTERSPEECH-2008, 1937-1940.