ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

The correspondences between the perception of the speaker individualities contained in speech sounds and their acoustic properties

Kanae Amino, Tsutomu Sugawara, Takayuki Arai

This study investigates the correspondences between the differences among the phones in human speaker identification and their acoustic properties. In the speaker identification test, the Japanese CV syllables excerpted from the carrier sentences were used as the stimuli. As pointed out in the previous studies, the stimuli containing the nasal sounds were significantly effective for the identification of the speakers, compared to other stimuli containing only the oral sounds. In the acoustic analyses, we analysed the spectral properties of the stimuli in order to explain these differences in the perception test, and we found that the cepstral distances among the speakers were significantly larger in the nasal sounds than in the oral sounds. Also, there were correspondences between the rankings of the consonants in the identification test and in the cepstral distances.


doi: 10.21437/Interspeech.2005-634

Cite as: Amino, K., Sugawara, T., Arai, T. (2005) The correspondences between the perception of the speaker individualities contained in speech sounds and their acoustic properties. Proc. Interspeech 2005, 2025-2028, doi: 10.21437/Interspeech.2005-634

@inproceedings{amino05_interspeech,
  author={Kanae Amino and Tsutomu Sugawara and Takayuki Arai},
  title={{The correspondences between the perception of the speaker individualities contained in speech sounds and their acoustic properties}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={2025--2028},
  doi={10.21437/Interspeech.2005-634}
}