Title :
Name disambiguation from link data in a collaboration graph
Author :
Baichuan Zhang ; Saha, Tapan K. ; Al Hasan, Mohammad
Author_Institution :
Dept. of Comput. & Inf. Sci., Indiana Univ.-Purdue Univ. Indianapolis, Indianapolis, IN, USA
Abstract :
The entity disambiguation task partitions the records belonging to multiple persons with the objective that each decomposed partition is composed of records of a unique person. Existing solutions to this task use either biographical attributes, or auxiliary features that are collected from external sources, such as Wikipedia. However, for many scenarios, such auxiliary features are not available, or they are costly to obtain. Besides, the attempt of collecting biographical or external data sustains the risk of privacy violation. In this work, we propose a method for solving entity disambiguation task from link information obtained from a collaboration network. Our method is non-intrusive of privacy as it uses only the time-stamped graph topology of an anonymized network. Experimental results on two real-life academic collaboration networks show that the proposed method has satisfactory performance.
Keywords :
data privacy; graph theory; groupware; Wikipedia; academic collaboration networks; anonymized network; auxiliary features; biographical attributes; collaboration graph; collaboration network; entity disambiguation task; link data; link information; name disambiguation; privacy violation; time-stamped graph topology; Collaboration; Conferences; Data privacy; Equations; Mathematical model; Social network services; Vectors;
Conference_Titel :
Advances in Social Networks Analysis and Mining (ASONAM), 2014 IEEE/ACM International Conference on
Conference_Location :
Beijing
DOI :
10.1109/ASONAM.2014.6921563