• DocumentCode
    509147
  • Title

    Automatic Knowledge Acquire System Oriented to Web Pages

  • Author

    Junwu, Zhu ; Yi, Jiang ; Yingying, Xu

  • Author_Institution
    Sch. of Inf. Eng., Yangzhou Univ., Yangzhou, China
  • Volume
    2
  • fYear
    2009
  • fDate
    21-22 Nov. 2009
  • Firstpage
    487
  • Lastpage
    490
  • Abstract
    The disordered way of the Web information organization has seriously hindered the knowledge sharing and interoperability, this paper presents a knowledge-oriented Web page automatic acquisition system (AKAS2WP). This system includes four core modules, and they are accessing of web pages, text extraction, the management and organizations of the concept and the attribute extraction of the concept. Accessing of Internet web pages is to download Web pages form certain site, saves and uses for web analytics, and text extraction filter module format Html document control symbols, to get a plain text file. Meanwhile, the management and organizations of the concept and the attribute extraction of the concept, respectively, obtains terminology of given certain domain, and get the terms´ description of the structure of property and structure of domain ontology. This system can be directly applied to the relevant field of automatic knowledge acquisition, so as to enhance the efficiency and accuracy of knowledge acquisition.
  • Keywords
    Internet; hypermedia markup languages; knowledge acquisition; text analysis; AKAS2WP implementation; Internet Web page; Web analytics; Web information organization; automatic knowledge acquire system; knowledge oriented Web page automatic acquisition system; knowledge sharing; text extraction filter module Html document control symbols; Algorithm design and analysis; Data mining; HTML; Information analysis; Information technology; Internet; Knowledge acquisition; Knowledge engineering; Ontologies; Web pages; Knowledge Acquire; Ontology; Web Page;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Information Technology Application, 2009. IITA 2009. Third International Symposium on
  • Conference_Location
    Nanchang
  • Print_ISBN
    978-0-7695-3859-4
  • Type

    conf

  • DOI
    10.1109/IITA.2009.383
  • Filename
    5369481