DocumentCode
3747461
Title
Improving key concept extraction using word association measurement
Author
Phuoc Thi Hong Doan;Ngamnij Arch-int;Somjit Arch-int
Author_Institution
Department of Computer Science - Faculty of Science -Khon Kaen University, Khon Kaen - Thailand
fYear
2015
Firstpage
403
Lastpage
407
Abstract
Ontologies play a very important role in information exchange and sharing, and are typically constructed by human experts. However, this process is very costly in both time and effort. Given this, there is a need for automated ontology construction from various knowledge (such as text files). A key challenge of automated ontology learning from text is to extract key concepts, which are relevant to the domain, from the documents. Existing approaches typically require a large set of training data with prior domain-specific knowledge. However, it is not always possible to provide such knowledge and trained data sets. To overcome this issue, we present a method to obtain key concepts from unstructured texts by using the word association measure and statistical knowledge. To demonstrate the efficiency of our method in comparison with a state-of-the-art method, extensive experiments, which employed two real-world datasets, were performed. The obtained results indicate that our method achieves better accuracy than the state of the art method for 3% to 10% in case of not having domain-specific knowledge. The results are more efficient if there are many noun phrases (in data sets) whose number of words is large.
Keywords
"Ontologies","Medical services","Feature extraction","Terminology","Pragmatics","Weight measurement","Information technology"
Publisher
ieee
Conference_Titel
Information Technology and Electrical Engineering (ICITEE), 2015 7th International Conference on
Type
conf
DOI
10.1109/ICITEED.2015.7408980
Filename
7408980
Link To Document