Title :
Integrating ontological and linguistic knowledge for conceptual information extraction
Author :
Basili, Roberto ; Vindigni, Michele ; Zanzotto, Fabio Massimo
Author_Institution :
Dept. of Comput. Sci., Rome Univ., Italy
Abstract :
Text understanding makes strong assumptions about the conceptualisation of the underlying knowledge domain. This mediates between the accomplishment of the specific task at the one hand and the knowledge expressed in the target text fragments at the other. However, building domain conceptualisations from scratch is a very complex and time-consuming task. Traditionally, the reuse of available domain resources, although not constituting always the best, has been applied as an accurate and cost effective solution. Here, we investigate the possibility of exploiting sources of domain knowledge (e.g. a subject reference system) to build a linguistically motivated domain concept hierarchy. The limitation connected with the use of domain taxonomies as ontological resources will be firstly discussed in the specific light of IE, i.e. for supporting linguistic inference. We then define a method for integrating the taxonomical domain knowledge and a general-purpose lexical knowledge base, like WordNet. A case study, i.e. the integration of the MeSH, Medical Subject Headings, and WordNet, will be then presented as a proof of the effectiveness and accuracy of the overall approach.
Keywords :
dictionaries; information retrieval; knowledge based systems; linguistics; text analysis; vocabulary; WordNet; conceptual information extraction; information reuse; knowledge integration; lexical knowledge base; linguistic knowledge; ontological knowledge; taxonomical domain knowledge; text understanding; Computer science; Costs; Data mining; Databases; Intelligent structures; Investments; Ontologies; Taxonomy; Text categorization;
Conference_Titel :
Web Intelligence, 2003. WI 2003. Proceedings. IEEE/WIC International Conference on
Print_ISBN :
0-7695-1932-6
DOI :
10.1109/WI.2003.1241190