DocumentCode
464197
Title
Using Web Directories for Similarity Measurement in Personal Name Disambiguation
Author
Vu, Quang Minh ; Masada, Tomonari ; Takasu, Atsuhiro ; Adachi, Jun
Author_Institution
Grad. Sch. of Inf. Sci. & Technol., Tokyo Univ., Tokyo
Volume
1
fYear
2007
fDate
21-23 May 2007
Firstpage
379
Lastpage
384
Abstract
In this paper, we target on the problem of personal name disambiguation in search results returned by personal name queries. Usually, a personal name refers to several people. Therefore, when a search engine returns a set of documents containing that name, they are often relevant to several individuals with the same namesake. Automatic differentiation of people in the resulting documents may help users to search for the person of interest easier. We propose a method that uses Web directories to improve the similarity measurement in personal name disambiguation. We carried out experiments on real Web documents in which we compared our method with the vector space model method and the named entity recognition method. The results show that our method has advantages over these previous methods.
Keywords
Internet; document handling; query processing; search engines; Web directory; document similarity measurement; personal name disambiguation; personal name query; search engine; Data mining; Databases; Informatics; Information science; Natural language processing; Pattern matching; Search engines; Social network services; Web sites; World Wide Web;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Information Networking and Applications Workshops, 2007, AINAW '07. 21st International Conference on
Conference_Location
Niagara Falls, Ont.
Print_ISBN
978-0-7695-2847-2
Type
conf
DOI
10.1109/AINAW.2007.367
Filename
4221089
Link To Document