EUROSPEECH 2003 - INTERSPEECH 2003
8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003

        

Resynthesis of 3D Tongue Movements from Facial Data

Olov Engwall, Jonas Beskow

KTH, Sweden

Simultaneous measurements of tongue and facial motion, using a combination of electromagnetic articulography (EMA) and optical motion tracking, are analysed to investigate the possibility to resynthesize the subject's tongue movements with a parametrically controlled 3D model using the facial data only. The recorded material consists of 63 VCV words spoken by one Swedish subject. The tongue movements are resynthesized using a combination of a linear estimation to predict the tongue data from the face and an inversion procedure to determine the articulatory parameters of the model.

Full Paper

Bibliographic reference.  Engwall, Olov / Beskow, Jonas (2003): "Resynthesis of 3d tongue movements from facial data", In EUROSPEECH-2003, 2261-2264.