This paper deals with articulatory copy synthesis from X-ray films. The underlying articulatory synthesizer uses an aerodynamic and an acoustic simulation using target area functions, F0 and transition patterns from one area function to the next as input data. The articulators, tongue in particular, have been delineated by hand or semi-automatically from the X-ray films. A specific attention has been paid on the determination of the centerline of the vocal tract from the image and on the coordination between glottal area and vocal tract constrictions since both aspects strongly impact on the acoustics. Experiments show that good quality speech can be resynthesized even if the interval between two images is 40 ms. The same approach could be easily applied to cine MRI data.
Bibliographic reference. Laprie, Yves / Loosvelt, Matthieu / Maeda, Shinji / Sock, Rudolph / Hirsch, Fabrice (2013): "Articulatory copy synthesis from cine x-ray films", In INTERSPEECH-2013, 2024-2028.