DocumentCode :
2865845
Title :
Mining ontological knowledge from domain-specific text documents
Author :
Jiang, Xing ; Tan, Ah-Hwee
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear :
2005
fDate :
27-30 Nov. 2005
Abstract :
Traditional text mining systems employ shallow parsing techniques and focus on concept extraction and taxonomic relation extraction. This paper presents a novel system called CRCTOL for mining rich semantic knowledge in the form of ontology from domain-specific text documents. By using a full text parsing technique and incorporating both statistical and lexico-syntactic methods, the knowledge extracted by our system is more concise and contains a richer semantics compared with alternative systems. We conduct a case study wherein CRCTOL extracts ontological knowledge, specifically key concepts and semantic relations, from a terrorism domain text collection. Quantitative evaluation, by comparing with a state-of-the-art ontology learning system known as text-to-onto, has shown that CRCTOL produces much better precision and recall for both concept and relation extraction, especially from sentences with complex structures.
Keywords :
data mining; grammars; learning systems; ontologies (artificial intelligence); statistical analysis; text analysis; concept extraction; concept relation concept tuple; domain-specific text document; full text parsing; lexico-syntactic method; ontological knowledge mining; ontology learning; relation extraction; statistical method; text mining; Data mining; Learning systems; Libraries; Natural language processing; Ontologies; Semantic Web; Software agents; Spine; Terrorism; Text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, Fifth IEEE International Conference on
ISSN :
1550-4786
Print_ISBN :
0-7695-2278-5
Type :
conf
DOI :
10.1109/ICDM.2005.97
Filename :
1565752
Link To Document :
بازگشت