• DocumentCode
    2865845
  • Title

    Mining ontological knowledge from domain-specific text documents

  • Author

    Jiang, Xing ; Tan, Ah-Hwee

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2005
  • fDate
    27-30 Nov. 2005
  • Abstract
    Traditional text mining systems employ shallow parsing techniques and focus on concept extraction and taxonomic relation extraction. This paper presents a novel system called CRCTOL for mining rich semantic knowledge in the form of ontology from domain-specific text documents. By using a full text parsing technique and incorporating both statistical and lexico-syntactic methods, the knowledge extracted by our system is more concise and contains a richer semantics compared with alternative systems. We conduct a case study wherein CRCTOL extracts ontological knowledge, specifically key concepts and semantic relations, from a terrorism domain text collection. Quantitative evaluation, by comparing with a state-of-the-art ontology learning system known as text-to-onto, has shown that CRCTOL produces much better precision and recall for both concept and relation extraction, especially from sentences with complex structures.
  • Keywords
    data mining; grammars; learning systems; ontologies (artificial intelligence); statistical analysis; text analysis; concept extraction; concept relation concept tuple; domain-specific text document; full text parsing; lexico-syntactic method; ontological knowledge mining; ontology learning; relation extraction; statistical method; text mining; Data mining; Learning systems; Libraries; Natural language processing; Ontologies; Semantic Web; Software agents; Spine; Terrorism; Text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, Fifth IEEE International Conference on
  • ISSN
    1550-4786
  • Print_ISBN
    0-7695-2278-5
  • Type

    conf

  • DOI
    10.1109/ICDM.2005.97
  • Filename
    1565752