11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Age and Gender Classification Using Fusion of Acoustic and Prosodic Features

Hugo Meinedo, Isabel Trancoso

INESC-ID Lisboa, Portugal

This paper presents a description of the INESC-ID Spoken Language Systems Laboratory (L2F) Age and Gender classification system submitted to the INTERSPEECH 2010 Paralinguistic Challenge. The L2F Age classi?cation system and the Gender classi?cation system are composed respectively by the fusion of four and six individual sub-systems trained with short and long term acoustic and prosodic features, different classification strategies (GMM-UBM, MLP and SVM) and using four different speech corpora. The best results obtained by the calibration and linear logistic regression fusion back-end show an absolute improvement of 4.1% on the unweighted accuracy value for the Age and 5.8% for the Gender when compared to the competition baseline systems in the development set.

Index Terms: Paralinguistic Challenge, Age, Gender, Fusion of Acoustic and Prosodic Features

Full Paper

Bibliographic reference.  Meinedo, Hugo / Trancoso, Isabel (2010): "Age and gender classification using fusion of acoustic and prosodic features", In INTERSPEECH-2010, 2818-2821.