16th Annual Conference of the International Speech Communication Association

Dresden, Germany
September 6-10, 2015

A Comprehensive 3D Biomechanically-Driven Vocal Tract Model Including Inverse Dynamics for Speech Research

Peter Anderson (1), Negar M. Harandi (1), Scott Moisik (2), Ian Stavness (3), Sidney Fels (1)

(1) University of British Columbia, Canada
(2) MPI for Psycholinguistics, The Netherlands
(3) University of Saskatchewan, Canada

We introduce a biomechanical model of oropharyngeal structures that adds the soft-palate, pharynx, and larynx to our previous models of jaw, skull, hyoid, tongue, and face in a unified model. The model includes a comprehensive description of the upper airway musculature, using point-to-point muscles that may either be embedded within the deformable structures or operate externally. The airway is described by an air-tight mesh that fits and deforms with the surrounding articulators, which enables dynamic coupling to our articulatory speech synthesizer. We demonstrate that the biomechanics, in conjunction with the skinning, supports a range from physically realistic to simplified vocal tract geometries to investigate different approaches to aeroacoustic modeling of vocal tract. Furthermore, our model supports inverse modeling to support investigation of plausible muscle activation patterns to generate speech.

Full Paper

Bibliographic reference.  Anderson, Peter / Harandi, Negar M. / Moisik, Scott / Stavness, Ian / Fels, Sidney (2015): "A comprehensive 3d biomechanically-driven vocal tract model including inverse dynamics for speech research", In INTERSPEECH-2015, 2395-2399.