![]() |
Auditory-Visual Speech Processing (AVSP'98)December 4-6, 1998 |
![]() |
An audio-visual intelligibility score is generally used as an evaluation measure in visual speech synthesis. Especially an intelligibility score of talking heads represents accuracy of facial models[1][2]. The facial models has two stages such as construction of real faces and realization of dynamical human-like motions. We focus on lip movement synthesis from input acoustic speech to realize dynamical motions. The goal of our researchis to synthesize lip movements natural enough to do lip-reading. In previous research, we have proposed a lip movement synthesis method using HMMs which can incorporate a forward coarticulation effect and confirmed its effectiveness through objective evaluation tests. In this paper, subjective evaluation tests are performed. Intelligibility test and acceptability test are conducted for subjective evaluation.
Bibliographic reference. Yamamoto, Eli / Nakamura, Satoshi / Shikano, Kiyohiro (1998): "Subjective evaluation for HMM-based speech-to-lip movement synthesis", In AVSP-1998, 227-232.