Third International Conference on Spoken Language Processing (ICSLP 94)
Lip movements during utterances of Japanese short sentences were predicted from orofacial muscle activity (EMG), using artificial neural network models. Inputs to the network model were EMG signals for six orofacial muscles. The lip movement parameters for output of the model were the horizontal distance between the corners of the mouth and the distance between the midsagittal lower lip and jaw markers. In addition to the relation between muscle EMG and articulator motion, the network learned the shape of a time-delay filter. Comparison of position, velocity, and acceleration prediction networks showed that the position prediction network performed best at recovering lip movement.
Bibliographic reference. Hirayama, Makoto / Vatikiotis-Bateson, Eric / Gracco, Vincent / Kawato, Mitsuo (1994): "Neural network prediction of lip shape from muscle EMG in Japanese speech", In ICSLP-1994, 587-590.