DocumentCode :
658349
Title :
OnPerDis: Ontology-Based Personal Name Disambiguation on the Web
Author :
Zhao Lu ; Zhixian Yan ; Liang He
Author_Institution :
Dept. of Comput. Sci. & Technol., East China Normal Univ., Shanghai, China
Volume :
1
fYear :
2013
fDate :
17-20 Nov. 2013
Firstpage :
185
Lastpage :
192
Abstract :
With the growth of web documents, the ambiguity of personal name becomes more common and brings poor performance of web search. Identifying a correct personal entity from the a piece of or the whole document is still a very challenging problem, especially for Chinese websites. In this paper, we propose a novel Ontology-based approach for Personal Name Disambiguation (named "OnPerDis"). This approach has two main steps: first, we construct person ontology (PO) with rich conceptual modeling as well as a large set of supporting instances, second, for a given personal name on the web, we create a temporary instance and extract features from the web documents, calculate the similarity between this temporary instance and the instances in the PO. The one with the highest similarity score is chosen as the appropriate personal name. Our extensive evaluations with two rich real-life datasets (CIPS-SIGHAN 2012 NERD and Chinese web documents) shows OnPerDis\´ efficacy on personal name disambiguation on the Web.
Keywords :
Web sites; document handling; feature extraction; information retrieval; natural language processing; ontologies (artificial intelligence); pattern matching; CIPS-SIGHAN 2012 NERD; Chinese Web documents; Chinese Web sites; OnPerDis; PO; Web search; conceptual modeling; feature extraction; ontology-based personal name disambiguation; person ontology; personal entity Identification; personal name ambiguity; real-life datasets; similarity score; temporary instance; Data mining; Educational institutions; Encyclopedias; Feature extraction; Ontologies; Sociology; Statistics; Conceptual modeling; Instance matching; Ontology population; Personal name disambiguation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2013 IEEE/WIC/ACM International Joint Conferences on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4799-2902-3
Type :
conf
DOI :
10.1109/WI-IAT.2013.28
Filename :
6690013
Link To Document :
بازگشت