DocumentCode :
2312720
Title :
Using Abstract Information and Community Alignment Information for Link Prediction
Author :
Sachan, Mrinmaya ; Ichise, Ryutaro
Author_Institution :
Comput. Sci. & Eng., Indian Inst. of Technol., Kanpur, India
fYear :
2010
fDate :
9-11 Feb. 2010
Firstpage :
61
Lastpage :
65
Abstract :
Although there have been many recent studies of link prediction in co-authorship networks, few have tried to utilize the Semantic information hidden in abstracts of the research documents. We propose to build a link predictor in a co-authorship network where nodes represent researchers and links represent co-authorship. In this method, we use the structure of the constructed graph, and propose to add a semantic approach using abstract information, research titles and the event information to improve the accuracy of the predictor. Secondly, we make use of the fact that researchers tend to work in close knit communities. The knowledge of a pair of researchers lying in the same dense community can be used to improve the accuracy of our predictor further. Finally, we test out hypothesis on the DBLP database in a reasonable time by under-sampling and balancing the data set using decision trees and the SMOTE technique.
Keywords :
citation analysis; data mining; decision trees; text analysis; SMOTE technique; abstract information; coauthorship network; community alignment information; decision trees; event information; graph structure; link prediction; research document abstracts; research titles; semantic information; Abstracts; Accuracy; Collaboration; Computer networks; Computer science; Data mining; Databases; Informatics; Machine learning; Testing; Data Mining; Graph Mining; Link Prediction; Machine Learning; Social Networks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Computing (ICMLC), 2010 Second International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4244-6006-9
Electronic_ISBN :
978-1-4244-6007-6
Type :
conf
DOI :
10.1109/ICMLC.2010.25
Filename :
5460690
Link To Document :
بازگشت