ISCA Archive ICSLP 1998
ISCA Archive ICSLP 1998

Recovering vocal tract shapes from MFCC parameters

Sorin Dusan, Li Deng

Recovering vocal tract shapes from the speech signal is a well known inversion problem of transformation from the articulatory system to speech acoustics. Most of the studies on this problem in the past have been focused on vowels. There have not been general methods effective for recovering the vocal tract shapes from the speech signal for all classes of speech sounds. In this paper we describe our attempt towards speech inverse mapping by using the mel-frequency cepstrum coefficients to represent the acoustic parameters of the speech signal. An inversion method is developed based on Kalman filtering and a dynamic-system model describing the articulatory motion. This method uses an articulatory-acoustic codebook derived from Maeda's articulatory model.


doi: 10.21437/ICSLP.1998-795

Cite as: Dusan, S., Deng, L. (1998) Recovering vocal tract shapes from MFCC parameters. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0367, doi: 10.21437/ICSLP.1998-795

@inproceedings{dusan98_icslp,
  author={Sorin Dusan and Li Deng},
  title={{Recovering vocal tract shapes from MFCC parameters}},
  year=1998,
  booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)},
  pages={paper 0367},
  doi={10.21437/ICSLP.1998-795}
}