DocumentCode
509147
Title
Automatic Knowledge Acquire System Oriented to Web Pages
Author
Junwu, Zhu ; Yi, Jiang ; Yingying, Xu
Author_Institution
Sch. of Inf. Eng., Yangzhou Univ., Yangzhou, China
Volume
2
fYear
2009
fDate
21-22 Nov. 2009
Firstpage
487
Lastpage
490
Abstract
The disordered way of the Web information organization has seriously hindered the knowledge sharing and interoperability, this paper presents a knowledge-oriented Web page automatic acquisition system (AKAS2WP). This system includes four core modules, and they are accessing of web pages, text extraction, the management and organizations of the concept and the attribute extraction of the concept. Accessing of Internet web pages is to download Web pages form certain site, saves and uses for web analytics, and text extraction filter module format Html document control symbols, to get a plain text file. Meanwhile, the management and organizations of the concept and the attribute extraction of the concept, respectively, obtains terminology of given certain domain, and get the terms´ description of the structure of property and structure of domain ontology. This system can be directly applied to the relevant field of automatic knowledge acquisition, so as to enhance the efficiency and accuracy of knowledge acquisition.
Keywords
Internet; hypermedia markup languages; knowledge acquisition; text analysis; AKAS2WP implementation; Internet Web page; Web analytics; Web information organization; automatic knowledge acquire system; knowledge oriented Web page automatic acquisition system; knowledge sharing; text extraction filter module Html document control symbols; Algorithm design and analysis; Data mining; HTML; Information analysis; Information technology; Internet; Knowledge acquisition; Knowledge engineering; Ontologies; Web pages; Knowledge Acquire; Ontology; Web Page;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Information Technology Application, 2009. IITA 2009. Third International Symposium on
Conference_Location
Nanchang
Print_ISBN
978-0-7695-3859-4
Type
conf
DOI
10.1109/IITA.2009.383
Filename
5369481
Link To Document