DocumentCode
640594
Title
An Approach for Populating and Enriching Ontology-Based Repositories
Author
Canito, Alda ; Maio, Paulo ; Silva, Nuno
Author_Institution
Sch. of Eng., Polytech. of Porto, Porto, Portugal
fYear
2013
fDate
26-30 Aug. 2013
Firstpage
123
Lastpage
127
Abstract
Publically available text-based documents (e.g. news, meeting transcripts) are a very important source of knowledge, especially for organizations. These documents mention domain entities such as persons, places, professional positions, decisions and actions. Querying these documents (instead of browsing, searching and finding) is a very relevant task for any person in general, and particularly for professionals dealing with intensive knowledge tasks. Querying text-based documents´ data, however, is not supported by common technology. For that, such documents´ content has to be explicitly and formally captured as facts into a knowledge base. Making use of automatic NLP processes for capturing such facts is a common approach, but their relatively low precision and recall give rise to data quality problems. Furthermore, facts existing in the documents are often insufficient to answer complex queries, thus the need to enrich the captured facts with facts from third-party repositories (e.g. public LOD). This paper describes the adopted process to clean, populate and enrich a knowledge base repository that is further exploited to answer complex queries. This process is triggered by a previous NLP parsing process and conducted by the (rich) ontology describing such repository.
Keywords
knowledge based systems; natural language processing; ontologies (artificial intelligence); organisational aspects; query processing; text analysis; automatic NLP parsing process; data quality problems; document content; domain entities; knowledge base repository; knowledge source; ontology-based repositories; organizations; precision value; public LOD; publically available text-based documents; query answering; recall value; text-based document data query; third-party repositories; Buildings; Data mining; Knowledge based systems; Merging; OWL; Ontologies; Semantics; Ontology Data Enrichment; Ontology Population; Ontology-based Data Cleaning;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Applications (DEXA), 2013 24th International Workshop on
Conference_Location
Los Alamitos, CA
ISSN
1529-4188
Print_ISBN
978-0-7695-5070-1
Type
conf
DOI
10.1109/DEXA.2013.19
Filename
6621358
Link To Document