Title :
Automatic stuff relation extraction from scientific documents for natural product ontology construction
Author :
Lertsakunsomboon, S. ; Pechsiri, C.
Author_Institution :
Fac. of Inf. Technol., Dhurakij Pundit Univ., Bangkok, Thailand
Abstract :
To extract Part-Whole relations, especially the stuff relation, from unstructured textual data is the challenging work This paper presents how to automatically extract the stuff relation from technical documents on the Web for supporting chemical industries. The research extracts the stuff relation without applying POS (Part-of-Speech) annotation. There are three problems of extracting the stuff relation: a) the identification of stuff relation without POS annotation problem, b) the chemical-formula-embedded name entity determination problem and c) the genus-species name entity determination problem. We propose using Naive Bayes to learn the stuff relation. The results from our proposed methodology are 87% precision and 61% recall.
Keywords :
Bayes methods; chemical industry; ontologies (artificial intelligence); production engineering computing; scientific information systems; text analysis; Naive Bayes; automatic stuff relation extraction; chemical industries; chemical-formula-embedded name entity determination problem; genus-species name entity determination problem; natural product ontology construction; part-whole relations; scientific documents; stuff relation identification; technical documents; unstructured textual data; chemical name entity; scientific name entity; stuff relation;
Conference_Titel :
Computing and Convergence Technology (ICCCT), 2012 7th International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4673-0894-6