ISCA Archive AVSP 2003
ISCA Archive AVSP 2003

Evaluation of a talking head based on appearance models

Barry-John Theobald, J. Andrew Bangham, Iain Matthews, Gavin Cawley

In this paper we describe how 2D appearance models can be applied to the problem of creating a near-videorealistic talking head. A speech corpus of a talker uttering a set of phonetically balanced training sentences is analysed using a generative model of the human face. Segments of original parameter trajectories corresponding to the synthesis unit are extracted from a codebook, normalised, blended, concatenated and smoothed before being applied to the model to give natural, realistic animations of novel utterances. We also present some early results of subjective tests conducted to determine the realism of the synthesiser.


Cite as: Theobald, B.-J., Bangham, J.A., Matthews, I., Cawley, G. (2003) Evaluation of a talking head based on appearance models. Proc. Auditory-Visual Speech Processing, 187-192

@inproceedings{theobald03_avsp,
  author={Barry-John Theobald and J. Andrew Bangham and Iain Matthews and Gavin Cawley},
  title={{Evaluation of a talking head based on appearance models}},
  year=2003,
  booktitle={Proc. Auditory-Visual Speech Processing},
  pages={187--192}
}