• DocumentCode
    607252
  • Title

    Automatic stuff relation extraction from scientific documents for natural product ontology construction

  • Author

    Lertsakunsomboon, S. ; Pechsiri, C.

  • Author_Institution
    Fac. of Inf. Technol., Dhurakij Pundit Univ., Bangkok, Thailand
  • fYear
    2012
  • fDate
    3-5 Dec. 2012
  • Firstpage
    273
  • Lastpage
    277
  • Abstract
    To extract Part-Whole relations, especially the stuff relation, from unstructured textual data is the challenging work This paper presents how to automatically extract the stuff relation from technical documents on the Web for supporting chemical industries. The research extracts the stuff relation without applying POS (Part-of-Speech) annotation. There are three problems of extracting the stuff relation: a) the identification of stuff relation without POS annotation problem, b) the chemical-formula-embedded name entity determination problem and c) the genus-species name entity determination problem. We propose using Naive Bayes to learn the stuff relation. The results from our proposed methodology are 87% precision and 61% recall.
  • Keywords
    Bayes methods; chemical industry; ontologies (artificial intelligence); production engineering computing; scientific information systems; text analysis; Naive Bayes; automatic stuff relation extraction; chemical industries; chemical-formula-embedded name entity determination problem; genus-species name entity determination problem; natural product ontology construction; part-whole relations; scientific documents; stuff relation identification; technical documents; unstructured textual data; chemical name entity; scientific name entity; stuff relation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing and Convergence Technology (ICCCT), 2012 7th International Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    978-1-4673-0894-6
  • Type

    conf

  • Filename
    6530341