• DocumentCode
    1843629
  • Title

    Mining Concepts from Wikipedia for Ontology Construction

  • Author

    Cui, Gaoying ; Lu, Qin ; Li, Wenjie ; Chen, Yirong

  • Volume
    3
  • fYear
    2009
  • fDate
    15-18 Sept. 2009
  • Firstpage
    287
  • Lastpage
    290
  • Abstract
    An ontology is a structured knowledgebase of concepts organized by relations among them. But concepts are usually mixed with their instances in the corpora for knowledge extraction. Concepts and their corresponding instances share similar features and are difficult to distinguish. In this paper, a novel approach is proposed to comprehensively obtain concepts with the help of definition sentences and Category Labels in Wikipedia pages. N-gram statistics and other NLP knowledge are used to help extracting appropriate concepts. The proposed method identified nearly 50,000 concepts from about 700,000 Wiki pages. The precision reaching 78.5% makes it an effective approach to mine concepts from Wikipedia for ontology construction.
  • Keywords
    Collaboration; Conferences; Information science; Intelligent agent; Intelligent structures; Ontologies; Search engines; Statistics; Taxonomy; Wikipedia; Concept; Ontology Construction; Wikipedia;
  • fLanguage
    English
  • Publisher
    iet
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technologies, 2009. WI-IAT '09. IEEE/WIC/ACM International Joint Conferences on
  • Conference_Location
    Milan, Italy
  • Print_ISBN
    978-0-7695-3801-3
  • Electronic_ISBN
    978-1-4244-5331-3
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2009.284
  • Filename
    5285031