• DocumentCode
    1825621
  • Title

    An effective approach to entity resolution problem using quasi-clique and its application to digital libraries

  • Author

    On, B.-W. ; Elmacioglu, E. ; Lee, Daewoo

  • Author_Institution
    Pennsylvania State Univ., University Park, PA
  • fYear
    2006
  • fDate
    11-15 June 2006
  • Firstpage
    51
  • Lastpage
    52
  • Abstract
    We study how to resolve entities that contain a group of related elements in them (e.g., an author entity with a list of citations or an intermediate result by GROUP BY SQL query). Such entities, named as grouped-entities, frequently occur in many applications. By exploiting contextual information mined from the group of elements per entity in addition to syntactic similarity, we show that our approach, Quasi-Clique, improves precision and recall unto 91% when used together with a variety of existing entity resolution solutions, but never worsens them
  • Keywords
    data mining; digital libraries; entity-relationship modelling; QuasiClique; digital library; entity resolution problem; grouped-entities; Collaboration; Data structures; Degradation; Entropy; Erbium; Information retrieval; Information systems; Joining processes; Partitioning algorithms; Software libraries; entity resolution; graph partition; name disambiguation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Libraries, 2006. JCDL '06. Proceedings of the 6th ACM/IEEE-CS Joint Conference on
  • Conference_Location
    Chapel Hill, NC
  • Print_ISBN
    1-59593-354-9
  • Type

    conf

  • DOI
    10.1145/1141753.1141761
  • Filename
    4119096