• DocumentCode
    3747461
  • Title

    Improving key concept extraction using word association measurement

  • Author

    Phuoc Thi Hong Doan;Ngamnij Arch-int;Somjit Arch-int

  • Author_Institution
    Department of Computer Science - Faculty of Science -Khon Kaen University, Khon Kaen - Thailand
  • fYear
    2015
  • Firstpage
    403
  • Lastpage
    407
  • Abstract
    Ontologies play a very important role in information exchange and sharing, and are typically constructed by human experts. However, this process is very costly in both time and effort. Given this, there is a need for automated ontology construction from various knowledge (such as text files). A key challenge of automated ontology learning from text is to extract key concepts, which are relevant to the domain, from the documents. Existing approaches typically require a large set of training data with prior domain-specific knowledge. However, it is not always possible to provide such knowledge and trained data sets. To overcome this issue, we present a method to obtain key concepts from unstructured texts by using the word association measure and statistical knowledge. To demonstrate the efficiency of our method in comparison with a state-of-the-art method, extensive experiments, which employed two real-world datasets, were performed. The obtained results indicate that our method achieves better accuracy than the state of the art method for 3% to 10% in case of not having domain-specific knowledge. The results are more efficient if there are many noun phrases (in data sets) whose number of words is large.
  • Keywords
    "Ontologies","Medical services","Feature extraction","Terminology","Pragmatics","Weight measurement","Information technology"
  • Publisher
    ieee
  • Conference_Titel
    Information Technology and Electrical Engineering (ICITEE), 2015 7th International Conference on
  • Type

    conf

  • DOI
    10.1109/ICITEED.2015.7408980
  • Filename
    7408980