Title :
Approach for Name Ambiguity Problem Using a Multiple-Layer Clustering
Author :
Jian, Wenrong ; Wang, Anbao ; Wu, Cuihong ; Chen, Jian ; Yan, Jihong
Author_Institution :
Sch. of Comput. & Inf., Shanghai Second Polytech. Univ., Shanghai, China
Abstract :
Name ambiguity refers to the problem of attributing a publication to a proper author. This is a common issue in digital library. It is a difficult problem as the same author´s name may be written in different ways and different authors may share the same name. In this paper, we examine a multiple-layer clustering approach which is based on a limited amount of associated information with each publication. It combines the Package-Merge algorithm, pattern-matching extraction methods, as well as a fuzzy logic rule based concept. This experimental study uses the DBLP collection as a case study, and the three attributes used are email addresses, the co-authorship relationship and paper title similarity. Our experiments show that this approach can distinguish authors and classify papers on the test dataset more accurately than the previous studies.
Keywords :
bibliographic systems; fuzzy set theory; pattern clustering; search engines; Package-Merge algorithm; coauthorship relationship; digital library; fuzzy logic rule; multiple-layer clustering; name ambiguity problem; pattern-matching extraction methods; Geophysical measurement techniques; Ground penetrating radar; Clustering; Fuzzy Logic; Name Ambiguity; Package-Merge;
Conference_Titel :
Computational Science and Engineering, 2009. CSE '09. International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4244-5334-4
Electronic_ISBN :
978-0-7695-3823-5
DOI :
10.1109/CSE.2009.110