Title :
Method of Web Information Extraction Based on Decision Tree
Author_Institution :
Sch. of Inf. & Electron. Eng., Zhejiang Univ. of Sci. & Technol., Hangzhou, China
Abstract :
Due to the constantly updated characteristic of data in Web, this paper studies the decision tree technology and how to use in the field of Web information extraction. According to the datasets by information extraction, a decision tree of agricultural products market is constructed by C4.5/C5.0 algorithm, with constantly updated data to update the decision tree, and then generate the understandable rules. The experiment proves that it is feasible to realize the Web information extraction based on the decision tree.
Keywords :
Internet; decision trees; information needs; information retrieval; C4.5/C5.0 algorithm; Web information extraction; World Wide Web; agricultural product market; decision tree technology; Agricultural engineering; Classification tree analysis; Consumer electronics; Data engineering; Data mining; Decision trees; Information technology; Internet; Testing; Web pages; Web information extraction; classification; decision tree; information gain;
Conference_Titel :
Information Technology and Applications, 2009. IFITA '09. International Forum on
Conference_Location :
Chengdu
Print_ISBN :
978-0-7695-3600-2
DOI :
10.1109/IFITA.2009.394