مرکز منطقه ای اطلاع رساني علوم و فناوري - Social Network Analysis on Name Disambiguation and More

DocumentCode :

3500055

Title :

Social Network Analysis on Name Disambiguation and More

Author :

On, Byung-Won

Author_Institution :

Dept. of Comput. Sci., Univ. of British Columbia, Vancouver, BC

Volume :

fYear :

2008

fDate :

11-13 Nov. 2008

Firstpage :

1081

Lastpage :

1088

Abstract :

Name variants are ubiquitous in real world due typographical errors (e.g., "Forschungszentrum Julich" vs. "Forschungszentrum Julich"), abbreviated, imcomplete, or missing information (e.g., "R. E. Ellis" vs. "Randy E. Ellis"), lack of standard name formatting convention (e.g., "Spike Jonze" vs. "Jones, Spike"), and their combinations. In this paper, we project this name disambiguation problem to graph representation, and then analyze graphs using social network analysis. In particular, we used real duplicate name entities that we manually verifed from ACM digital library. Then, using various string similarity metrics and additional information (i.e., co-author names, titles, and venues), we analyze the effectiveness of string similarity metrics and additional information based on social network analysis. Through our experimental validation, name disambiguation problem can be analyzed in graphical, visual manner.

Keywords :

digital libraries; graph theory; network theory (graphs); string matching; ACM digital library; graph representation; name disambiguation problem; name variants; social network analysis; string similarity metrics; Computer errors; Computer science; Databases; Erbium; Information analysis; Information technology; Portals; Search problems; Social network services; Software libraries; Name Disambiguation; Social Networks; String Similarity Metrics;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Convergence and Hybrid Information Technology, 2008. ICCIT '08. Third International Conference on

Conference_Location :

Busan

Print_ISBN :

978-0-7695-3407-7

Type :

conf

DOI :

10.1109/ICCIT.2008.210

Filename :

4682391

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3500055