DocumentCode :
1827091
Title :
Search engine driven author disambiguation
Author :
Tan, Yee Fan ; Kan, Min-Yen ; Lee, Dongwon
Author_Institution :
Dept. of Comput. Sci., Nat. Univ. of Singapore
fYear :
2006
fDate :
38869
Firstpage :
314
Lastpage :
315
Abstract :
In scholarly digital libraries, author disambiguation is an important task that attributes a scholarly work with specific authors. This is critical when individuals share the same name. We present an approach to this task that analyzes the results of automatically-crafted Web searches. A key observation is that pages from rare Web sites are stronger source of evidence than pages from common Web sites, which we model as inverse host frequency (IHF). Our system is able to achieve an average accuracy of 0.836
Keywords :
Internet; bibliographic systems; citation analysis; digital libraries; search engines; Web sites; automatically-crafted Web searches; citation analysis; inverse host frequency; scholarly digital libraries; search engine driven author disambiguation; Computer science; Drives; Frequency; Information retrieval; Information systems; Inverse problems; Search engines; Software libraries; Uniform resource locators; Web search; IHF; author disambiguation; entity resolution;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Libraries, 2006. JCDL '06. Proceedings of the 6th ACM/IEEE-CS Joint Conference on
Conference_Location :
Chapel Hill, NC
Print_ISBN :
1-59593-354-9
Type :
conf
DOI :
10.1145/1141753.1141826
Filename :
4119147
Link To Document :
بازگشت