A lumped physical model of the glottal source is presented. Vocal folds are described as single masses but, unlike conventional one-mass models, vertical phase differences between upper and lower margins of the folds are taken into account. This is done by appropriately describing the non-linear interaction of the mechanical model with aerodynamics, resulting in a modified one-mass model, or a 'one-delayed-mass model'. Analysis on numerical simulations shows that the system behaves qualitatively as higher-dimensional ones (such as the two-mass model by Ishizaka and Flanagan); in particular, control over flow skewness is guaranteed, allowing for synthesis of realistic glottal flow waveforms. As only one degree of freedom (one mass) is needed in the model, structure and number of parameters are drastically reduced, thus making it suitable for real-time synthesis applications.
Cite as: Avanzini, F., Alku, P., Karjalainen, M. (2001) One-delayed-mass model for efficient synthesis of glottal flow. Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 51-54, doi: 10.21437/Eurospeech.2001-12
@inproceedings{avanzini01_eurospeech, author={Federico Avanzini and Paavo Alku and Matti Karjalainen}, title={{One-delayed-mass model for efficient synthesis of glottal flow}}, year=2001, booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)}, pages={51--54}, doi={10.21437/Eurospeech.2001-12} }