Title :
Paper Classification by Topic Grouping in Citation Networks
Author :
Su, Yi-Jen ; Wun, Jian-Cheng ; Hsu, Wei-Lin ; Chen, Yue-Qun
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Shu-Te Univ., Kaohsiung, Taiwan
Abstract :
The enormous popularity of Web 2.0 social network services has led to much research on social network analysis (SNA). These studies focus on analyzing the complex interactive activities between users in the world of virtual networks. SNA has shown great potential in automatic document classification, especially in identifying citation networks of research papers and the references among them. This research adopts the Clique Percolation Method (CPM) to identify all overlapping subgroups in a citation network. In the grouping process, research papers with similar topics will be grouped into the same topic group. Two papers are regarded as having a relationship when the common citation rate between them is higher than the threshold. A modified TF-IDF calculates the weight of each keyword in the topic groups. The keyword-weight vector represents the main features of each group, while the category of a new-coming document is determined by a novel similarity function. All the papers under study are collected from the journal IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) published from 1979 to 2011.
Keywords :
Internet; citation analysis; document handling; pattern classification; social networking (online); CPM; Clique Percolation Method; SNA; TF-IDF; TPAMI; Web 2.0 social network services; automatic document classification; citation networks; complex interactive activities; keyword-weight vector; novel similarity function; paper classification; similar topics; social network analysis; topic grouping; transactions on pattern analysis and machine intelligence; Algorithm design and analysis; Classification algorithms; Communities; Complex networks; Data mining; Partitioning algorithms; Social network services; CPM; Citation Network; Social Network Analysis; TF-IDF;
Conference_Titel :
Computing, Measurement, Control and Sensor Network (CMCSN), 2012 International Conference on
Conference_Location :
Taiyuan
Print_ISBN :
978-1-4673-2033-7
DOI :
10.1109/CMCSN.2012.53