Title :
The Content Extraction Method of Webpage Information Based on Knowledge Base
Author :
Chen, Guowei ; Zhang, Pengzhou
Author_Institution :
MITI Lab., Commun. Univ. of China, Beijing, China
Abstract :
Web content extraction is actually the process of transforming web unstructured information into structured information. Knowledge base has the advantages of ordering information and knowledge, also be used conveniently. So it´s convenient to retrieve information and knowledge, and it makes base for effective use. Knowledge base will speed up the knowledge and the flow of information and make for knowledge sharing and communication. This paper puts forward a web information extraction method which is based on the knowledge base. Experiment results show that the method has greatly increased efficiency and accuracy of the web information extraction.
Keywords :
Internet; information retrieval; Web content extraction; Web information extraction; Webpage information; communication; knowledge base; knowledge sharing; unstructured information; Accuracy; Data mining; HTML; Internet; Knowledge based systems; Web pages; KA; PA; Semistructured Data; information extraction; knowledge base;
Conference_Titel :
Computational Sciences and Optimization (CSO), 2012 Fifth International Joint Conference on
Conference_Location :
Harbin
Print_ISBN :
978-1-4673-1365-0
DOI :
10.1109/CSO.2012.142