ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Demographic recommendation by means of group profile elicitation using speaker age and gender recognition

Sven Ewan Shepstone, Zheng-Hua Tan, Søren Holdt Jensen

In this paper we show a new method of using automatic age and gender recognition to recommend a sequence of multimedia items to a home TV audience comprising multiple viewers. Instead of relying on explicitly provided demographic data for each user, we define an audio-based demographic group profile that captures the age and gender for all members of the audience. A 7-class age and gender classifier employing a fusion of acoustic and prosodic features determines the probability of each speaker belonging to each class. The information for all speakers is then combined to form the group profile, which itself is the input to a recommender system. The recommender system finds the content items whose demographics best match the group profile. We tested the effectiveness of the system for several typical home audience configurations. In a survey, users were given a configuration and asked to rate a set of advertisements on how well each advertisement matched the configuration. Unbeknown to the subjects, half of the adverts were recommended using the derived audio demographics and the other half were randomly chosen. The recommended adverts received a significantly higher median rating of 7.75, as opposed to 4.25 for the randomly selected adverts.


doi: 10.21437/Interspeech.2013-244

Cite as: Shepstone, S.E., Tan, Z.-H., Jensen, S.H. (2013) Demographic recommendation by means of group profile elicitation using speaker age and gender recognition. Proc. Interspeech 2013, 2827-2831, doi: 10.21437/Interspeech.2013-244

@inproceedings{shepstone13_interspeech,
  author={Sven Ewan Shepstone and Zheng-Hua Tan and Søren Holdt Jensen},
  title={{Demographic recommendation by means of group profile elicitation using speaker age and gender recognition}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={2827--2831},
  doi={10.21437/Interspeech.2013-244}
}