Large-Scale Speaker Ranking from Crowdsourced Pairwise Listener Ratings

Timo Baumann


Speech quality and likability is a multi-faceted phenomenon consisting of a combination of perceptory features that cannot easily be computed nor weighed automatically. Yet, it is often easy to decide which of two voices one likes better, even though it would be hard to describe why, or to name the underlying basic perceptory features. Although likability is inherently subjective and individual preferences differ frequently, generalizations are useful and there is often a broad intersubjective consensus about whether one speaker is more likable than another. However, breaking down likability rankings into pairwise comparisons leads to a quadratic explosion of rating pairs. We present a methodology and software to efficiently create a likability ranking for many speakers from crowdsourced pairwise likability ratings. We collected pairwise likability ratings for many (>220) speakers from many raters (>160) and turn these ratings into one likability ranking. We investigate the resulting speaker ranking stability under different conditions: limiting the number of ratings and the dependence on rater and speaker characteristics. We also analyze the ranking wrt. acoustic correlates to find out what factors influence likability. We publish our ranking and the underlying ratings in order to facilitate further research.


 DOI: 10.21437/Interspeech.2017-1697

Cite as: Baumann, T. (2017) Large-Scale Speaker Ranking from Crowdsourced Pairwise Listener Ratings. Proc. Interspeech 2017, 2262-2266, DOI: 10.21437/Interspeech.2017-1697.


@inproceedings{Baumann2017,
  author={Timo Baumann},
  title={Large-Scale Speaker Ranking from Crowdsourced Pairwise Listener Ratings},
  year=2017,
  booktitle={Proc. Interspeech 2017},
  pages={2262--2266},
  doi={10.21437/Interspeech.2017-1697},
  url={http://dx.doi.org/10.21437/Interspeech.2017-1697}
}