ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Toward an acoustic-articulatory model of inter-speaker variability

Parham Mokhtari, Frantz Clermont, Kazuyo Tanaka

In this paper we propose a more complete model of inter-speaker variability, which accounts quantitatively for structural differences of the vocal tract (VT) and for learned differences in articulatory setting and in phoneme-specific strategy. This tripartite modelling is applied to a dataset of VT area functions estimated by acoustic-to-articulatory mapping, from formants measured in the steady-states of 5 vowels recorded by 5 adult, male speakers of Japanese. Principal component analyses of each of those physical sources of inter-speaker variation then yield eigenmodes and their variances, which together define an acoustic-articulatory, functional model of inter-speaker variability in vowel production.


Cite as: Mokhtari, P., Clermont, F., Tanaka, K. (2000) Toward an acoustic-articulatory model of inter-speaker variability. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 2, 158-161

@inproceedings{mokhtari00_icslp,
  author={Parham Mokhtari and Frantz Clermont and Kazuyo Tanaka},
  title={{Toward an acoustic-articulatory model of inter-speaker variability}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 2, 158-161}
}