ISCA Archive VOQUAL 2003
ISCA Archive VOQUAL 2003

Defining and measuring voice quality

Jody Kreiman, Diana Vanlancker-Sidtis, Bruce R. Gerratt

Although voices provide listeners with significant information about speakers, defining and measuring voice quality remain elusive goals. We argue that the much-maligned ANSI standard definition of sound quality is in fact an appropriate definition, because it treats quality as the result of a perceptual process rather than a fixed quantity, and highlights the interaction between listeners and signals in determining quality in the context of specific perceptual goals. Which aspects of the signal are important will depend on the task, the characteristics of the stimuli, the listener’s background, perceptual habits, and so on. Given the many kinds of information listeners extract from voice signals, it is not surprising that these characteristics vary from task to task, voice to voice, and listener to listener. Application of speech synthesis in method-of-adjustment tasks allows measurement of quality psychoacoustically as those aspects of the signal that allow a listener to determine that two sounds of equal pitch and loudness are different, and holds promise for improving the reliability and validity of measures of voice quality.


Cite as: Kreiman, J., Vanlancker-Sidtis, D., Gerratt, B.R. (2003) Defining and measuring voice quality. Proc. Voice Quality: Functions, Analysis and Synthesis (VOQUAL 2003), 115-120

@inproceedings{kreiman03b_voqual,
  author={Jody Kreiman and Diana Vanlancker-Sidtis and Bruce R. Gerratt},
  title={{Defining and measuring voice quality}},
  year=2003,
  booktitle={Proc. Voice Quality: Functions, Analysis and Synthesis (VOQUAL 2003)},
  pages={115--120}
}