A systematic comparative study of audio to visual speech conversion methods is described in this paper. A direct conversion system is compared to conceptually different ASR based solutions. Hybrid versions of the different solutions will also be presented. The methods are tested using the same speech material, audio preprocessing and facial motion visualization units. Only the conversion blocks are changed. Subjective opinion score evaluation tests prove the naturalness of the direct conversion is the best.
Bibliographic reference. Takacs, Gyorgy (2009): "Direct, modular and hybrid audio to visual speech conversion methods - a comparative study", In INTERSPEECH-2009, 2267-2270.