ISCA Archive ICSLP 1994
ISCA Archive ICSLP 1994

Comparative study of spectral representations in measuring the English /r/-/l/ acoustic-perceptual dissimilarity

Kiyoaki Aikawa, Reiko A. Yamada

The talker dependency of the correct response rate (CRR) on English /r/-/l/ identification by Japanese listeners has already been reported. This paper shows that the talker dependency of the CRR can be explained by the acoustical dissimilarity (ADS) between an /r/ and an /I/ measured by the dynamic-cepstrum. The dynamic-cepstrum is a new spectral representation which simulates time-frequency forward masking. Nine spectral representations including weighted-cepstrum, mel-cepstrum, and delta-cepstrum were compared in terms of correlation between the CRR and the ADS. The ADS measured by the dynamic-cepstrum showed the best correlation with the CRR. The experimental results imply that Japanese listeners tend to identify /r/ or /I/ using the succeeding vowels affected by co-articulations.


Cite as: Aikawa, K., Yamada, R.A. (1994) Comparative study of spectral representations in measuring the English /r/-/l/ acoustic-perceptual dissimilarity. Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994), 2039-2042

@inproceedings{aikawa94b_icslp,
  author={Kiyoaki Aikawa and Reiko A. Yamada},
  title={{Comparative study of spectral representations in measuring the English /r/-/l/ acoustic-perceptual dissimilarity}},
  year=1994,
  booktitle={Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994)},
  pages={2039--2042}
}