Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

The Correspondences Between the Perception of the Speaker Individualities Contained in Speech Sounds and Their Acoustic Properties

Kanae Amino, Tsutomu Sugawara, Takayuki Arai

Sophia University, Japan

This study investigates the correspondences between the differences among the phones in human speaker identification and their acoustic properties. In the speaker identification test, the Japanese CV syllables excerpted from the carrier sentences were used as the stimuli. As pointed out in the previous studies, the stimuli containing the nasal sounds were significantly effective for the identification of the speakers, compared to other stimuli containing only the oral sounds. In the acoustic analyses, we analysed the spectral properties of the stimuli in order to explain these differences in the perception test, and we found that the cepstral distances among the speakers were significantly larger in the nasal sounds than in the oral sounds. Also, there were correspondences between the rankings of the consonants in the identification test and in the cepstral distances.

Full Paper

Bibliographic reference.  Amino, Kanae / Sugawara, Tsutomu / Arai, Takayuki (2005): "The correspondences between the perception of the speaker individualities contained in speech sounds and their acoustic properties", In INTERSPEECH-2005, 2025-2028.