DocumentCode :
3229142
Title :
Name Disambiguation in Person Information Mining
Author :
Wei, Yu-Chuan ; Lin, Ming-Shun ; Chen, Hsin-Hsi
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei
fYear :
2006
fDate :
18-22 Dec. 2006
Firstpage :
378
Lastpage :
381
Abstract :
This paper considers five features, personal titles, community chains, terms, temporal expressions, and hostnames for personal name disambiguation. In 9 test data sets covering 3 ambiguous personal names, we address the issues of awareness degree of an entity, the source of materials and Web pages in different areas. Two approaches, single-clusterer and cascaded multiple-clusterer, are proposed. In the experiments, the proposed features are quite useful; the multiple-clusterer approach is better than the single-clusterer approach; and expanding community chains using the Web has positive effects on personal name disambiguation
Keywords :
Internet; data mining; information analysis; pattern clustering; Web page; cascaded multiple-clusterer approach; community chain; person information mining; personal name disambiguation; single-clusterer approach; Computer science; Data mining; Materials testing; Social network services; Statistics; Web pages; Web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence, 2006. WI 2006. IEEE/WIC/ACM International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2747-7
Type :
conf
DOI :
10.1109/WI.2006.121
Filename :
4061399
Link To Document :
بازگشت