DocumentCode
3078539
Title
A novel approach for entity linkage
Author
Stoermer, Heiko ; Bouquet, Paolo
Author_Institution
Dept. of Inf. Sci. & Eng., Univ. of Trento, Trento, Italy
fYear
2009
fDate
10-12 Aug. 2009
Firstpage
151
Lastpage
156
Abstract
The problem of data linkage in the semantic Web can be divided in two lines of action: schema and ontology matching/mapping, which allows us to draw conclusions about sets of individuals through concept relations, and entity-level linkage, where more information can be reached from distributed sources because of the fact that the information is about the same entity. While the area of schema and ontology matching is traditionally much addressed, it appears that today the semantic Web looks very much like a collection of ldquoinformation islandsrdquo that are very poorly integrated with each other, especially on the individual level; and when some of these islands are linked, this is often the result of a lot of hard and time-consuming manual work. The general problem we are working on is to provide a structured approach of how to improve the situation of data linkage at the level of individuals in the Web of data. As a specific contribution, in this article we describe a novel algorithm for entity linkage - called name feature match - based on a recent empirical investigation about how humans describe individuals (or entities). We show in a first experimental evaluation that such an approach, which takes into account the cognitive point of view of entity representation by humans, can provide an improvement over other relevant approaches.
Keywords
knowledge representation languages; ontologies (artificial intelligence); pattern matching; semantic Web; OWL; Web-of-data; cognitive point; concept relation; data linkage; distributed sources; entity representation; entity-level linkage algorithm; information island collection; name feature matching algorithm; ontology matching/mapping; schema matching/mapping; semantic Web; structured approach; Couplings; Face recognition; Humans; Information science; Information systems; Large-scale systems; Ontologies; Resource description framework; Semantic Web; Vocabulary; Entity Name System; Entity-centric Information Integration; Identifier Reuse; Unique Identifiers;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Reuse & Integration, 2009. IRI '09. IEEE International Conference on
Conference_Location
Las Vegas, NV
Print_ISBN
978-1-4244-4114-3
Electronic_ISBN
978-1-4244-4116-7
Type
conf
DOI
10.1109/IRI.2009.5211542
Filename
5211542
Link To Document