DocumentCode
2698008
Title
A Structural Sampling Technique for Better Decision Trees
Author
Sug, Hyontai
Author_Institution
Div. of Comput. & Inf. Eng., Dongseo Univ., Busan, South Korea
fYear
2009
fDate
1-3 April 2009
Firstpage
24
Lastpage
27
Abstract
Since data mining problems contain a large amount of data, sampling is a necessity for the success of the task. Decision trees have been developed for prediction, and finding decision trees with smaller error rates has been a major task for their success. This paper suggests a structural sampling technique that is based on a generated decision tree, where the tree is generated based on fast and dirty tree generation algorithm. Experiments with several sample sizes and representative decision tree algorithms showed that the method is more effective with respect to decision tree size and error rate than conventional random sampling method especially for small sample size.
Keywords
data mining; decision trees; sampling methods; data mining; data sampling; decision tree size; dirty tree generation algorithm; error rate; random sampling method; representative decision tree algorithm; structural sampling technique; Data mining; Decision trees; Deductive databases; Error analysis; Greedy algorithms; Induction generators; Intelligent structures; Machine learning; Sampling methods; Scalability; C4.5; CART; decision trees; sampling;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Information and Database Systems, 2009. ACIIDS 2009. First Asian Conference on
Conference_Location
Dong Hoi
Print_ISBN
978-0-7695-3580-7
Type
conf
DOI
10.1109/ACIIDS.2009.24
Filename
5175961
Link To Document