Title :
A multi-label model to predict undisclosed attributes in microblogging
Author :
Xingfeng Pan; Jin Yang; Xiaofeng Qiu
Author_Institution :
Beijing Key Laboratory of Network System Architecture and Convergence, Beijing University of Posts and Telecommunications, China
Abstract :
In this paper, the automatic prediction of micro-blog users´ attribute information is modeled as a multi-label classification process considering the relevance and co-occurrence of user attributes instead of single-label or regression, where each user has more than one attribute. Two basic and one improved multi-label classification methods are evaluated and compared in this task. Taking micro-blogging as the background, we conduct a series of experiments, comparison and analysis to prove that the multi-label classification is a very effective method to predict the user profiles considering the relevance and co-occurrence of user attributes. In feature selection stage, we consider a variety of attribute information of users such as user nickname, user tags and user personal description. The results show that a relatively great performance is obtained. Finally, we analyze the user profiling and the user preferred vocabulary in the associated attributes.
Keywords :
"Convergence","Telecommunications","Loss measurement","Data models","Entertainment industry","Data acquisition","Data preprocessing"
Conference_Titel :
Behavioral, Economic and Socio-cultural Computing (BESC), 2015 International Conference on
DOI :
10.1109/BESC.2015.7365949