ISCA Archive SSW 2004
ISCA Archive SSW 2004

Audiovisual text-to-cued speech synthesis

Guillaume Gibert, Gérard Bailly, Frédéric Elisei

We present here our efforts for implementing a system able to synthesize French Manual Cued Speech (FMCS). We recorded and analyzed the 3D trajectories of 50 hand and 63 facial flesh points during the production of 238 utterances carefully designed for covering all possible di-phones of the French language. Linear and non linear statistical models of the hand and face deformations and postures have been developed using both separate and joint corpora. We create 2 separate dictionaries, one containing diphones and another one containing "dikeys". Using these 2 dictionaries, we implement a complete text-to-cued speech synthesis system by concatenation of di-phones and dikeys.


Cite as: Gibert, G., Bailly, G., Elisei, F. (2004) Audiovisual text-to-cued speech synthesis. Proc. 5th ISCA Workshop on Speech Synthesis (SSW 5), 85-90

@inproceedings{gibert04_ssw,
  author={Guillaume Gibert and Gérard Bailly and Frédéric Elisei},
  title={{Audiovisual text-to-cued speech synthesis}},
  year=2004,
  booktitle={Proc. 5th ISCA Workshop on Speech Synthesis (SSW 5)},
  pages={85--90}
}