ISCA Archive VOQUAL 2003
ISCA Archive VOQUAL 2003

Emotions and voice quality: experiments with sinusoidal modeling

Carlo Drioli, Graziano Tisato, Piero Cosi, Fabio Tesser

Voice quality is recognized to play an important role for the rendering of emotions in verbal communication. In this paper we explore the effectiveness of a sinusoidal modeling processing framework for voice transformations finalized to the analysis and synthesis of emotive speech. A set of acoustic cues is selected to compare the voice quality characteristics of the speech signals on a voice corpus in which different emotions are reproduced. The sinusoidal signal processing tool is used to convert a neutral utterance into emotive utterances. Two different procedures are applied and compared: in the first one, only the alignment of phoneme duration and of pitch contour is performed; the second procedure refines the transformations by using a spectral conversion function. This refinement improves the reproduction of the different voice qualities of the target emotive utterances. The acoustic cues extracted from the transformed utterances are compared to the emotive original utterances, and the properties and quality of the transformation method are discussed.


Cite as: Drioli, C., Tisato, G., Cosi, P., Tesser, F. (2003) Emotions and voice quality: experiments with sinusoidal modeling. Proc. Voice Quality: Functions, Analysis and Synthesis (VOQUAL 2003), 127-132

@inproceedings{drioli03_voqual,
  author={Carlo Drioli and Graziano Tisato and Piero Cosi and Fabio Tesser},
  title={{Emotions and voice quality: experiments with sinusoidal modeling}},
  year=2003,
  booktitle={Proc. Voice Quality: Functions, Analysis and Synthesis (VOQUAL 2003)},
  pages={127--132}
}