DocumentCode
1737157
Title
Approach for Name Ambiguity Problem Using a Multiple-Layer Clustering
Author
Jian, Wenrong ; Wang, Anbao ; Wu, Cuihong ; Chen, Jian ; Yan, Jihong
Author_Institution
Sch. of Comput. & Inf., Shanghai Second Polytech. Univ., Shanghai, China
Volume
4
fYear
2009
Firstpage
874
Lastpage
878
Abstract
Name ambiguity refers to the problem of attributing a publication to a proper author. This is a common issue in digital library. It is a difficult problem as the same author´s name may be written in different ways and different authors may share the same name. In this paper, we examine a multiple-layer clustering approach which is based on a limited amount of associated information with each publication. It combines the Package-Merge algorithm, pattern-matching extraction methods, as well as a fuzzy logic rule based concept. This experimental study uses the DBLP collection as a case study, and the three attributes used are email addresses, the co-authorship relationship and paper title similarity. Our experiments show that this approach can distinguish authors and classify papers on the test dataset more accurately than the previous studies.
Keywords
bibliographic systems; fuzzy set theory; pattern clustering; search engines; Package-Merge algorithm; coauthorship relationship; digital library; fuzzy logic rule; multiple-layer clustering; name ambiguity problem; pattern-matching extraction methods; Geophysical measurement techniques; Ground penetrating radar; Clustering; Fuzzy Logic; Name Ambiguity; Package-Merge;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Science and Engineering, 2009. CSE '09. International Conference on
Conference_Location
Vancouver, BC
Print_ISBN
978-1-4244-5334-4
Electronic_ISBN
978-0-7695-3823-5
Type
conf
DOI
10.1109/CSE.2009.110
Filename
5283137
Link To Document