ISCA Archive AVSP 2001
ISCA Archive AVSP 2001

Viseme space for realistic speech animation

Sumedha Kshirsagar, Nadia Magnenat-Thalmann

For realistic speech animation, smooth viseme and expression transitions, blending and co-articulation so far have been studied and experimented in depth. In this paper, we describe an approach for speech animation by smooth viseme transition. Though this method cannot form an alternative to the co-articulation phenomenon, it certainly takes us a step nearer to realistic speech animation. The approach is devised as a result of the Principal Component Analysis of facial capture extracted data using an optical tracking system. The system extracts the 3D positions of markers attached at the specific feature point locations on face to capture the facial movements of a talking person. We form a vector space representation by using the Principal Component Analysis of this data. We call this space the "viseme space". We use the viseme space to generate convincing speech animation and to make smooth transitions from one viseme to another. As the analysis and the resulting viseme space automatically consider the dynamics of and the deformation constraints on the facial movements, the resulting facial animation is very realistic.

Cite as: Kshirsagar, S., Magnenat-Thalmann, N. (2001) Viseme space for realistic speech animation. Proc. Auditory-Visual Speech Processing, 30-35

  author={Sumedha Kshirsagar and Nadia Magnenat-Thalmann},
  title={{Viseme space for realistic speech animation}},
  booktitle={Proc. Auditory-Visual Speech Processing},