September 22-25, 1997
In order to investigate the relationship between human perception in speaker identification and acoustic features (fundamental frequency (f0), spectrum, and duration) under various communication conditions, this paper describes several perception experiments and an approach to predict the perceptual contribution rate of each feature. Factors taken into account in this paper are: (1) speaker familiarity and (2) background noise. As a result, it is shown that: (1) the perceptual contribution rate increases as the distance of anacoustic feature increases, (2) the spectral contribution rates for familiar speakers are larger than those for unfamiliar speakers, (3) the contribution of f0 tends to increase as the noise increases, and (4) in case of the same S/N ratio, the contribution of f0 in the computer room noise environment is larger than in the car noise environment.
Bibliographic reference. Kitagawa, Satoshi / Hashimoto, Makoto / Higuchi, Norio (1997): "Effect of speaker familiarity and background noise on acoustic features used in speaker identification", In EUROSPEECH-1997, 2179-2182.