This work surveys the potential for predicting demographic traits of individual speakers (gender, age, education level, ethnicity, and geographic region) using only word usage features derived from the output of a speech recognition system on conversational American English. Significant differences in word usage patterns among the different classes allows for reasonably high classification accuracy (60%-82%), even without extensive training data.
Bibliographic reference. Gillick, Dan (2010): "Can conversational word usage be used to predict speaker demographics?", In INTERSPEECH-2010, 1381-1384.