DocumentCode
2549625
Title
An Efficient Classification System Based on Binary Search Trees for Data Streams Mining
Author
Wang, Tao ; Li, Zhoujun ; Yan, Yuejin ; Chen, Huowang ; Yu, Jinshan
Author_Institution
Nat. Univ. of Defense Technol., Changsha
fYear
2007
fDate
22-28 April 2007
Firstpage
15
Lastpage
15
Abstract
Decision tree construction is a well-studied problem in data mining. Recently, there has been much interest in mining data streams. Domingos and Hulten have presented a one-pass algorithm for decision tree constructions. Their system using Hoeffding inequality to achieve a probabilistic bound on the accuracy of the tree constructed. In this paper, we revisit this problem and propose a decision tree classifier system that uses binary search trees to handle numerical attributes. The proposed system is based on the most successful VFDT, and it achieves excellent performance. The most relevant property of our system is an average large reduction in processing time, while keeps the same tree size and accuracy.
Keywords
data analysis; data mining; decision trees; pattern classification; probability; Hoeffding inequality; binary search tree; data stream mining; decision tree classifier system; probability; Binary search trees; Classification tree analysis; Computer science; Data engineering; Data mining; Data processing; Databases; Decision trees; Input variables; Robustness;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, 2007. ICONS '07. Second International Conference on
Conference_Location
Martinique
Print_ISBN
0-7695-2807-4
Electronic_ISBN
0-7695-2807-4
Type
conf
DOI
10.1109/ICONS.2007.12
Filename
4196317
Link To Document