DocumentCode
660651
Title
CMPK: A High Accuracy Microblog User Classification Method for Professional Analysis
Author
Ying Peng ; Haiquan Wang
Author_Institution
Sch. of Comput. Sci. & Eng., Beihang Univ., Beijing, China
fYear
2013
fDate
4-6 Nov. 2013
Firstpage
134
Lastpage
139
Abstract
Analyzing and mining the massive data recorded in microblog in order to discover the characteristics and rules of individual behaviors, group behaviors and interactive behaviors is now the research hotspot of massive data mining and behavioral analysis area. However, the influence of social attributes, such as user´s occupation, to his behavior and social relations is always neglected in the existing researches. Concerning this issue, the paper proposed a high accuracy microblog user classification method for professional analysis - CMPK (Classification Method based on Professional lexicon and K-nearest neighbor algorithm), this method uses vector space model combined with the professional lexicon and KNN (K-Nearest Neighbor algorithm) classification algorithm to analyze the industry that the microblog user belongs to based on all kinds of information he put on the network. The experiment proved that the accuracy rate of CMPK is nearly 90% which is high precision.
Keywords
Web sites; behavioural sciences computing; data analysis; data mining; learning (artificial intelligence); pattern classification; CMPK method; behavioral analysis; classification method based on professional lexicon and k-nearest neighbor algorithm; group behaviors; individual behaviors; interactive behaviors; massive data analysis; massive data mining; microblog; microblog user classification method; professional analysis; Classification algorithms; Computer architecture; Computers; Feature extraction; Industries; Support vector machine classification; Training; K-Nearest Neighbor algorithm; text mining; user classification; vector space model;
fLanguage
English
Publisher
ieee
Conference_Titel
Cloud and Service Computing (CSC), 2013 International Conference on
Conference_Location
Beijing
Type
conf
DOI
10.1109/CSC.2013.28
Filename
6693190
Link To Document