ISCA Archive SSW 2007
ISCA Archive SSW 2007

Perspectives for articulatory speech synthesis

Bernd J. Kröger

Articulatory speech synthesis currently has two perspectives. (i) Technical perspective: Due to progress in common computer hardware (general increase in computation rate) and software (usability of compilers and simulation software) it is now possible to develop comprehensive phonetic models of speech production reaching nearly real-time for the calculation of acoustic speech signals. Furthermore the phonetic knowledge increased to a degree that these production models now are capable of accomplishing a good up to high acoustic quality. Limitations are mainly the control modules. In this paper we argue for a self-learning input dependent gestural control model for articulatory speech synthesis. (ii) Theoretical perspective: A comprehensive articulatory speech synthesis system capable of producing high quality acoustic output necessarily incorporates a lot of knowledge on all phonetic aspects of speech production: articulatory sound targets, typical articulatory movement strategies for realizing sounds or syllables (e.g. coarticulation), a general concept for temporal coordination of speech relevant articulatory movements (i.e. speech gestures) etc. In this paper an example for such a system will be given and a suggestion for the still open question on strategies for control concepts for high-quality articulatory speech synthesis will be proposed.

Cite as: Kröger, B.J. (2007) Perspectives for articulatory speech synthesis. Proc. 6th ISCA Workshop on Speech Synthesis (SSW 6), 391 (abstract)

  author={Bernd J. Kröger},
  title={{Perspectives for articulatory speech synthesis}},
  booktitle={Proc. 6th ISCA Workshop on Speech Synthesis (SSW 6)},
  pages={391 (abstract)}