ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Direct, modular and hybrid audio to visual speech conversion methods - a comparative study

Gyorgy Takacs

A systematic comparative study of audio to visual speech conversion methods is described in this paper. A direct conversion system is compared to conceptually different ASR based solutions. Hybrid versions of the different solutions will also be presented. The methods are tested using the same speech material, audio preprocessing and facial motion visualization units. Only the conversion blocks are changed. Subjective opinion score evaluation tests prove the naturalness of the direct conversion is the best.


doi: 10.21437/Interspeech.2009-643

Cite as: Takacs, G. (2009) Direct, modular and hybrid audio to visual speech conversion methods - a comparative study. Proc. Interspeech 2009, 2267-2270, doi: 10.21437/Interspeech.2009-643

@inproceedings{takacs09_interspeech,
  author={Gyorgy Takacs},
  title={{Direct, modular and hybrid audio to visual speech conversion methods - a comparative study}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2267--2270},
  doi={10.21437/Interspeech.2009-643}
}