9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Two- and Three-Dimensional Visual Articulatory Models for Pronunciation Training and for Treatment of Speech Disorders

Bernd J. Kröger (1), Verena Graf-Borttscheller (1), Anja Lowit (2)

(1) RWTH Aachen University, Germany; (2) University of Strathclyde, UK

Visual articulatory models can be used for visualizing vocal tract articulatory speech movements. This information may be helpful in pronunciation training or in therapy of speech disorders. For testing this hypothesis, speech recognition rates were quantified for mute animations of vocalic and consonantal speech movements generated by a 2D and a 3D visual articulatory model. The visually based speech sound recognition test (mimicry test) was performed by two groups of eight children (five to eight years old) matched in age and sex. The children were asked to mimic the visually produced mute speech movement animations for different speech sounds. Recognition rates stay significantly above chance but indicate no significant difference for each of the two models. Children older than 5 years are capable of interpreting vocal tract articulatory speech sound movements without any preparatory training in a speech adequate way. The complex 3D-display of vocal tract articulatory movements provides no significant advantage in comparison to the visually simpler 2D-midsagittal displays of vocal tract articulatory movements.

Full Paper

Bibliographic reference.  Kröger, Bernd J. / Graf-Borttscheller, Verena / Lowit, Anja (2008): "Two- and three-dimensional visual articulatory models for pronunciation training and for treatment of speech disorders", In INTERSPEECH-2008, 2639-2642.