An analysis of the user occupational class through Twitter content

Citation:
Preoţiuc-Pietro, Daniel, Vasileios Lampos, and Nikolaos Aletras. An analysis of the user occupational class through Twitter content In ACL., 2015.

Abstract:

Social media content can be used as a complementary source to the traditional methods for extracting and studying collective social attributes. This study focuses on the prediction of the occupational class for a public user profile. Our analysis is conducted on a new annotated corpus of Twitter users, their respective job titles, posted textual content and platform-related attributes. We frame our task as classification using latent feature representations such as word clusters and embeddings. The employed linear and, especially, non-linear methods can predict a user’s occupational class with strong accuracy for the coarsest level of a standard occupation taxonomy which includes nine classes. Combined with a qualitative assessment, the derived results confirm the feasibility of our approach in inferring a new user attribute that can be embedded in a multitude of downstream applications.

[Slides] [Video] [Dataset] [Readme

[SOC Hierarchy] [Word clusters][GP tutorial]

PreviewAttachmentSize
PDF386.61 KB
Slides804.93 KB