Third European Conference on Speech Communication and Technology

Berlin, Germany
September 22-25, 1993


Physiologically-Motivated Modeling of the Voice Source in Articulatory Analysis/Synthesis

Juergen Schroeter (1), Bert Cranen (2)

(1) Acoustics Research Dept., 2D-545, AT&T Bell Laboratories, Murray Hill, NJ, USA
(2) Dept. of Language and Speech, Nijmegen University, Nijmegen, The Netherlands

This paper describes the implementation of a new parametric model of the glottal geometry aimed at improving female (and male) speech synthesis in the framework of articulatory analysis!synthesis of speech. The model is controlled by parameters that are tightly coupled to physiology, such as, for example, vocal-fold abduction. It is imbedded in an articulatory analysis/synthesis system (articulatory speech mimic). To introduce naturally-occurring details in our glottal flow waveforms, we included two different kinds of glottal leakage in our model: a "linked leak" and a "parallel glottal chink". While the first is basically an incomplete glottal closure that results in, among other things, a steeper roll-off of the glottal flow spectrum, the latter models a second glottal duct that is independent of the membranous (vibrating) part of the glottis. Our simulations show that, as far as deterministic excitation (not the glottal noise) is concerned, a parallel glottal chink can actually enhance spectral levels at high frequencies relative to the no-leakage case.

Full Paper

Bibliographic reference.  Schroeter, Juergen / Cranen, Bert (1993): "Physiologically-motivated modeling of the voice source in articulatory analysis/synthesis", In EUROSPEECH'93, 95-98.