DocumentCode
3165751
Title
Social Network Extraction of Academic Researchers
Author
Tang, Jie ; Zhang, Duo ; Yao, Limin
Author_Institution
Tsinghua Univ., Tsinghua
fYear
2007
fDate
28-31 Oct. 2007
Firstpage
292
Lastpage
301
Abstract
This paper addresses the issue of extraction of an academic researcher social network. By researcher social network extraction, we are aimed at finding, extracting, and fusing the ´semantic ´-based profiling information of a researcher from the Web. Previously, social network extraction was often undertaken separately in an ad-hoc fashion. This paper first gives a formalization of the entire problem. Specifically, it identifies the ´relevant documents´ from the Web by a classifier. It then proposes a unified approach to perform the researcher profiling using conditional random fields (CRF). It integrates publications from the existing bibliography datasets. In the integration, it proposes a constraints-based probabilistic model to name disambiguation. Experimental results on an online system show that the unified approach to researcher profiling significantly outperforms the baseline methods of using rule learning or classification. Experimental results also indicate that our method to name disambiguation performs better than the baseline method using unsupervised learning. The methods have been applied to expert finding. Experiments show that the accuracy of expert finding can be significantly improved by using the proposed methods.
Keywords
Internet; learning (artificial intelligence); bibliography datasets; conditional random fields; constraints-based probabilistic model; online system; social network extraction; unsupervised learning; Application software; Biometrics; Computer science; Computer vision; Data mining; Image databases; Indexing; Social network services; USA Councils; Visual databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on
Conference_Location
Omaha, NE
ISSN
1550-4786
Print_ISBN
978-0-7695-3018-5
Type
conf
DOI
10.1109/ICDM.2007.30
Filename
4470253
Link To Document