• DocumentCode
    2427435
  • Title

    Automated Extraction of Conceptual Knowledge from a Chinese Machine-Readable Dictionary

  • Author

    Hu, Yi ; Lu, Ruzhan ; Chen, Yuquan ; Zhao, Jinglei

  • Author_Institution
    Shanghai Jiao Tong Univ., Shanghai
  • Volume
    4
  • fYear
    2007
  • fDate
    24-27 Aug. 2007
  • Firstpage
    578
  • Lastpage
    582
  • Abstract
    In this paper, we exploit a Chinese machine-readable dictionary to extract the conceptual knowledge, i.e. the <attribute, value> pairs involving in hypernym, (artificiality) material, (artificiality) function and (medicine) usage from the corresponding definitions of nominal entries. Our method focuses on (1) constructing the extraction patterns and (2) the statistical decision for applying these patterns. Therefore our work is designed to be a new three-step procedure. Firstly, annotate the definitions of a number of nominal entries that are used as training samples of these four attributes and contextual linguistic features; secondly, design different patterns for extracting such conceptual knowledge, and learn the applicability of the patterns by a Maximum Entropy (ME) classifier to decide whether a pattern can be used in current context or not; at last, apply these patterns to the remaining nominal entries of the dictionary, and we achieve relatively satisfying results.
  • Keywords
    classification; dictionaries; information retrieval; knowledge acquisition; maximum entropy methods; natural language processing; Chinese machine-readable dictionary; automated conceptual knowledge extraction; information retrieval; maximum entropy classifier; Buildings; Charge coupled devices; Computer science; Data mining; Databases; Dictionaries; Entropy; Information analysis; Knowledge engineering; Statistical analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
  • Conference_Location
    Haikou
  • Print_ISBN
    978-0-7695-2874-8
  • Type

    conf

  • DOI
    10.1109/FSKD.2007.199
  • Filename
    4406453