ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Vocal tract area function inversion by linear regression of cepstrum

Parham Mokhtari, Tatsuya Kitamura, Hironori Takemoto, Kiyoshi Honda

Vocal tract data from 3D cine-MRI are used together with synchronised acoustics to evaluate a linear regression model for inversion. The first two principal components of vocalic area functions are predicted with correlations 0.99 and 0.97 respectively, from 24 FFT-cepstra measured in the frequency band 0-4 kHz. This best regression model together with the two component representation yields mean absolute errors of 0.37 cm2 in section area and 0.15 cm in vocal tract length.


doi: 10.21437/Interspeech.2005-845

Cite as: Mokhtari, P., Kitamura, T., Takemoto, H., Honda, K. (2005) Vocal tract area function inversion by linear regression of cepstrum. Proc. Interspeech 2005, 3201-3204, doi: 10.21437/Interspeech.2005-845

@inproceedings{mokhtari05_interspeech,
  author={Parham Mokhtari and Tatsuya Kitamura and Hironori Takemoto and Kiyoshi Honda},
  title={{Vocal tract area function inversion by linear regression of cepstrum}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={3201--3204},
  doi={10.21437/Interspeech.2005-845}
}