• DocumentCode
    2758015
  • Title

    Automatic Extending HowNet´s Attribute Lexicon on the Web

  • Author

    Zhao, Jinglei ; Liu, Hui ; Lu, Ruzhan

  • Author_Institution
    Dept. of Comput. Sci., Shanghai Jiao Tong Univ., Shanghai
  • fYear
    2007
  • fDate
    16-18 Dec. 2007
  • Firstpage
    315
  • Lastpage
    320
  • Abstract
    It is well known that lexical knowledge sources as WordNet, HowNet are very important to natural language processing applications in artificial intelligence. This paper proposes a new Web-mining algorithm to automatically extend HowNetpsilas attribute set. Seeded by the original attribute set in HowNet, the algorithm uses a conjunctive lexical pattern plus a validation mechanism called PES (position exchanging search) to extend the lexicon iteratively on the Web. Also, a Web-based attribute classifier is constructed which behaves as a filter to control the level of false positives during each iteration. The algorithm is evaluated using both standard human judgements and HowNet based evaluation. Some experimental results about the performance of the method are provided.
  • Keywords
    Internet; artificial intelligence; data mining; iterative methods; natural language processing; pattern classification; HowNet; Hownet´s attribute lexicon; Web-based attribute classifier; Web-mining algorithm; WordNet; World Wide Web; artificial intelligence; conjunctive lexical pattern; lexical knowledge sources; natural language processing applications; position exchanging search; Application software; Computer science; Data mining; Filters; Internet; Iterative algorithms; Knowledge acquisition; Natural language processing; Signal processing; Web pages; Attribute Learning; HowNet;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal-Image Technologies and Internet-Based System, 2007. SITIS '07. Third International IEEE Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-0-7695-3122-9
  • Type

    conf

  • DOI
    10.1109/SITIS.2007.28
  • Filename
    4618791