DocumentCode :
640596
Title :
Representing Texts as Contextualized Entity-Centric Linked Data Graphs
Author :
Freitas, Adelaide ; O´Riain, Sean ; Curry, Edward ; Da Silva, Joao Carlos P. ; Carvalho, Danilo S.
Author_Institution :
Digital Enterprise Researh Inst. (DERI), Nat. Univ. of Ireland, Galway, Galway, Ireland
fYear :
2013
fDate :
26-30 Aug. 2013
Firstpage :
133
Lastpage :
137
Abstract :
The integration of a small fraction of the information present in the Web of Documents to the Linked Data Web can provide a significant shift on the amount of information available to data consumers. However, information extracted from text does not easily fit into the usually highly normalized structure of ontology-based datasets. While the representation of structured data assumes a high level of regularity, relatively simple and consistent conceptual models, the representation of information extracted from texts need to take into account large terminological variation, complex contextual/dependency patterns, and fuzzy or conflicting semantics. This work focuses on bridging the gap between structured and unstructured data, proposing the representation of text as structured discourse graphs (SDGs), targeting an RDF representation of unstructured data. The representation focuses on a semantic best-effort information extraction scenario, where information from text is extracted under a pay-as-you-go data quality perspective, trading terminological normalization for domain-independency, context capture, wider representation scope and maximization of textual information capture.
Keywords :
Internet; data structures; information retrieval; ontologies (artificial intelligence); text analysis; RDF representation; SDG; complex contextual patterns; contextualized entity-centric linked data graphs; large terminological variation; linked data Web; ontology-based datasets; pay-as-you-go data quality perspective; semantic best-effort information extraction scenario; structured data; structured discourse graphs; terminological normalization; text representation; unstructured data; Context; Context modeling; Data mining; Data models; Ontologies; Resource description framework; Semantics; Discoruse Graphs; Discourse Representation; Linked Data; Semantic Web;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications (DEXA), 2013 24th International Workshop on
Conference_Location :
Los Alamitos, CA
ISSN :
1529-4188
Print_ISBN :
978-0-7695-5070-1
Type :
conf
DOI :
10.1109/DEXA.2013.21
Filename :
6621360
Link To Document :
بازگشت