• DocumentCode
    464197
  • Title

    Using Web Directories for Similarity Measurement in Personal Name Disambiguation

  • Author

    Vu, Quang Minh ; Masada, Tomonari ; Takasu, Atsuhiro ; Adachi, Jun

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Tokyo Univ., Tokyo
  • Volume
    1
  • fYear
    2007
  • fDate
    21-23 May 2007
  • Firstpage
    379
  • Lastpage
    384
  • Abstract
    In this paper, we target on the problem of personal name disambiguation in search results returned by personal name queries. Usually, a personal name refers to several people. Therefore, when a search engine returns a set of documents containing that name, they are often relevant to several individuals with the same namesake. Automatic differentiation of people in the resulting documents may help users to search for the person of interest easier. We propose a method that uses Web directories to improve the similarity measurement in personal name disambiguation. We carried out experiments on real Web documents in which we compared our method with the vector space model method and the named entity recognition method. The results show that our method has advantages over these previous methods.
  • Keywords
    Internet; document handling; query processing; search engines; Web directory; document similarity measurement; personal name disambiguation; personal name query; search engine; Data mining; Databases; Informatics; Information science; Natural language processing; Pattern matching; Search engines; Social network services; Web sites; World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Information Networking and Applications Workshops, 2007, AINAW '07. 21st International Conference on
  • Conference_Location
    Niagara Falls, Ont.
  • Print_ISBN
    978-0-7695-2847-2
  • Type

    conf

  • DOI
    10.1109/AINAW.2007.367
  • Filename
    4221089