EUROSPEECH 2003 - INTERSPEECH 2003
Simultaneous measurements of tongue and facial motion, using a combination of electromagnetic articulography (EMA) and optical motion tracking, are analysed to investigate the possibility to resynthesize the subject's tongue movements with a parametrically controlled 3D model using the facial data only. The recorded material consists of 63 VCV words spoken by one Swedish subject. The tongue movements are resynthesized using a combination of a linear estimation to predict the tongue data from the face and an inversion procedure to determine the articulatory parameters of the model.
Bibliographic reference. Engwall, Olov / Beskow, Jonas (2003): "Resynthesis of 3d tongue movements from facial data", In EUROSPEECH-2003, 2261-2264.