ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Perceived naturalness of a synthesizer of disordered voices

Samia Fraj, Francis Grenez, Jean Schoentgen

The presentation describes a synthesizer of normal and disordered voice timbres and their perceptual evaluation with respect to naturalness. The simulator uses a shaping function model, which enables controlling the perturbations of the frequency and harmonic richness of the glottal area signal via the control of the instantaneous frequency and amplitude of two harmonic driving functions. Several types of perturbations are simulated. Perceptual experiments, which involve stimuli of synthetic and human vowels with normal values of perturbations, have been carried out. The first has been based on a binary synthetic/natural classification. The second has involved a discrimination task. Both experiments suggest that human judges are unable to distinguish between human and synthetic vowels prepared with the synthesizer described here.

doi: 10.21437/Interspeech.2009-736

Cite as: Fraj, S., Grenez, F., Schoentgen, J. (2009) Perceived naturalness of a synthesizer of disordered voices. Proc. Interspeech 2009, 2907-2910, doi: 10.21437/Interspeech.2009-736

  author={Samia Fraj and Francis Grenez and Jean Schoentgen},
  title={{Perceived naturalness of a synthesizer of disordered voices}},
  booktitle={Proc. Interspeech 2009},