DocumentCode :
526632
Title :
XML-based Web information extraction system design and implementation
Author :
Jun, Ma ; Tihong, Li
Author_Institution :
Inf. Eng. Coll., Jiaozuo Univ., Jiaozuo, China
Volume :
8
fYear :
2010
fDate :
9-11 July 2010
Firstpage :
551
Lastpage :
554
Abstract :
Based on the research of the existing Web information extraction techniques, this paper proposes a XML-based Web information extraction system design. This design can extract the interested information points from HTML pages and express the extracted results by using structured XML with strong scalability. The system has certain versatility and flexibility, users can quickly customize for the Web information extraction wrapper to be used in different fields.
Keywords :
Web services; XML; HTML pages; Web information extraction system; XML; Data mining; Databases; HTML; Information technology; XML; Web information extraction; XML; XSLT; extraction rules;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Technology (ICCSIT), 2010 3rd IEEE International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-5537-9
Type :
conf
DOI :
10.1109/ICCSIT.2010.5564746
Filename :
5564746
Link To Document :
بازگشت