14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Articulatory Copy Synthesis from Cine X-Ray Films

Yves Laprie (1), Matthieu Loosvelt (2), Shinji Maeda (2), Rudolph Sock (3), Fabrice Hirsch (4)

(1) LORIA, France
(2) LPP (UMR 7018), France
(3) IPS, France
(4) Praxiling, France

This paper deals with articulatory copy synthesis from X-ray films. The underlying articulatory synthesizer uses an aerodynamic and an acoustic simulation using target area functions, F0 and transition patterns from one area function to the next as input data. The articulators, tongue in particular, have been delineated by hand or semi-automatically from the X-ray films. A specific attention has been paid on the determination of the centerline of the vocal tract from the image and on the coordination between glottal area and vocal tract constrictions since both aspects strongly impact on the acoustics. Experiments show that good quality speech can be resynthesized even if the interval between two images is 40 ms. The same approach could be easily applied to cine MRI data.

Full Paper

Bibliographic reference.  Laprie, Yves / Loosvelt, Matthieu / Maeda, Shinji / Sock, Rudolph / Hirsch, Fabrice (2013): "Articulatory copy synthesis from cine x-ray films", In INTERSPEECH-2013, 2024-2028.