INTERSPEECH 2011
12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Estimation of Perceptual Spaces for Speaker Identities Based on the Cross-Lingual Discrimination Task

Minoru Tsuzaki (1), Keiichi Tokuda (2), Hisashi Kawai (3), Jinfu Ni (3)

(1) Kyoto City University of Arts, Japan
(2) Nagoya Institute of Technology, Japan
(3) NICT, Japan

This paper reconfirms that talker identity can be transmitted across languages. Talker discrimination was examined in the ABX paradigm, where the stimuli A and B were utterances by different talkers in the same language and the stimulus X was an utterance by either of A or B in the different language. The average hit rate of this discrimination task was as high as 0.89. The mutual distance matrices were generated using the discrimination index, d. By applying the multidimensional scaling, three-dimensional perceptual spaces were estimated. The features related with loudness and spectral centroid had high contribution to the perceptual dimensions.

Full Paper

Bibliographic reference.  Tsuzaki, Minoru / Tokuda, Keiichi / Kawai, Hisashi / Ni, Jinfu (2011): "Estimation of perceptual spaces for speaker identities based on the cross-lingual discrimination task", In INTERSPEECH-2011, 157-160.