Title :
Using Web Directories for Similarity Measurement in Personal Name Disambiguation
Author :
Vu, Quang Minh ; Masada, Tomonari ; Takasu, Atsuhiro ; Adachi, Jun
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Tokyo Univ., Tokyo
Abstract :
In this paper, we target on the problem of personal name disambiguation in search results returned by personal name queries. Usually, a personal name refers to several people. Therefore, when a search engine returns a set of documents containing that name, they are often relevant to several individuals with the same namesake. Automatic differentiation of people in the resulting documents may help users to search for the person of interest easier. We propose a method that uses Web directories to improve the similarity measurement in personal name disambiguation. We carried out experiments on real Web documents in which we compared our method with the vector space model method and the named entity recognition method. The results show that our method has advantages over these previous methods.
Keywords :
Internet; document handling; query processing; search engines; Web directory; document similarity measurement; personal name disambiguation; personal name query; search engine; Data mining; Databases; Informatics; Information science; Natural language processing; Pattern matching; Search engines; Social network services; Web sites; World Wide Web;
Conference_Titel :
Advanced Information Networking and Applications Workshops, 2007, AINAW '07. 21st International Conference on
Conference_Location :
Niagara Falls, Ont.
Print_ISBN :
978-0-7695-2847-2
DOI :
10.1109/AINAW.2007.367