ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Estimation of vocal tract area function for Mandarin vowel sequences using MRI

Gaowu Wang, Jianwu Dang, Jiangping Kong

To fully explore the dynamic properties of speech production and investigate the relation between vocal tract geometry and speech acoustics, estimation of vocal tract area functions from measurements of the sagittal plane is an important step. In this study, we investigated the relation between the measurements on two dimensional (2D) and three dimensional (3D) MRI data and used an alpha-beta model to describe this relation. As a result, a set of parameters were derived from 3D static MRI data, and applied to time-varying vocal tract widths derived from 2D MRI movies, to synthesize Mandarin vowel sequences. An acoustic evaluation comparing the natural and calculated formants shows that the alpha-beta model can represent dynamic states of articulatory movements of vowel sequences, as well as those of the sustained vowels.


doi: 10.21437/Interspeech.2008-357

Cite as: Wang, G., Dang, J., Kong, J. (2008) Estimation of vocal tract area function for Mandarin vowel sequences using MRI. Proc. Interspeech 2008, 1182-1185, doi: 10.21437/Interspeech.2008-357

@inproceedings{wang08g_interspeech,
  author={Gaowu Wang and Jianwu Dang and Jiangping Kong},
  title={{Estimation of vocal tract area function for Mandarin vowel sequences using MRI}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={1182--1185},
  doi={10.21437/Interspeech.2008-357}
}