• DocumentCode
    1737157
  • Title

    Approach for Name Ambiguity Problem Using a Multiple-Layer Clustering

  • Author

    Jian, Wenrong ; Wang, Anbao ; Wu, Cuihong ; Chen, Jian ; Yan, Jihong

  • Author_Institution
    Sch. of Comput. & Inf., Shanghai Second Polytech. Univ., Shanghai, China
  • Volume
    4
  • fYear
    2009
  • Firstpage
    874
  • Lastpage
    878
  • Abstract
    Name ambiguity refers to the problem of attributing a publication to a proper author. This is a common issue in digital library. It is a difficult problem as the same author´s name may be written in different ways and different authors may share the same name. In this paper, we examine a multiple-layer clustering approach which is based on a limited amount of associated information with each publication. It combines the Package-Merge algorithm, pattern-matching extraction methods, as well as a fuzzy logic rule based concept. This experimental study uses the DBLP collection as a case study, and the three attributes used are email addresses, the co-authorship relationship and paper title similarity. Our experiments show that this approach can distinguish authors and classify papers on the test dataset more accurately than the previous studies.
  • Keywords
    bibliographic systems; fuzzy set theory; pattern clustering; search engines; Package-Merge algorithm; coauthorship relationship; digital library; fuzzy logic rule; multiple-layer clustering; name ambiguity problem; pattern-matching extraction methods; Geophysical measurement techniques; Ground penetrating radar; Clustering; Fuzzy Logic; Name Ambiguity; Package-Merge;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Science and Engineering, 2009. CSE '09. International Conference on
  • Conference_Location
    Vancouver, BC
  • Print_ISBN
    978-1-4244-5334-4
  • Electronic_ISBN
    978-0-7695-3823-5
  • Type

    conf

  • DOI
    10.1109/CSE.2009.110
  • Filename
    5283137