DocumentCode :
3165751
Title :
Social Network Extraction of Academic Researchers
Author :
Tang, Jie ; Zhang, Duo ; Yao, Limin
Author_Institution :
Tsinghua Univ., Tsinghua
fYear :
2007
fDate :
28-31 Oct. 2007
Firstpage :
292
Lastpage :
301
Abstract :
This paper addresses the issue of extraction of an academic researcher social network. By researcher social network extraction, we are aimed at finding, extracting, and fusing the ´semantic ´-based profiling information of a researcher from the Web. Previously, social network extraction was often undertaken separately in an ad-hoc fashion. This paper first gives a formalization of the entire problem. Specifically, it identifies the ´relevant documents´ from the Web by a classifier. It then proposes a unified approach to perform the researcher profiling using conditional random fields (CRF). It integrates publications from the existing bibliography datasets. In the integration, it proposes a constraints-based probabilistic model to name disambiguation. Experimental results on an online system show that the unified approach to researcher profiling significantly outperforms the baseline methods of using rule learning or classification. Experimental results also indicate that our method to name disambiguation performs better than the baseline method using unsupervised learning. The methods have been applied to expert finding. Experiments show that the accuracy of expert finding can be significantly improved by using the proposed methods.
Keywords :
Internet; learning (artificial intelligence); bibliography datasets; conditional random fields; constraints-based probabilistic model; online system; social network extraction; unsupervised learning; Application software; Biometrics; Computer science; Computer vision; Data mining; Image databases; Indexing; Social network services; USA Councils; Visual databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on
Conference_Location :
Omaha, NE
ISSN :
1550-4786
Print_ISBN :
978-0-7695-3018-5
Type :
conf
DOI :
10.1109/ICDM.2007.30
Filename :
4470253
Link To Document :
بازگشت