DocumentCode :
3633649
Title :
Calculating the VC-dimension of decision trees
Author :
Ozlem Asian;Olcay Taner Yildiz;Ethem Alpaydin
Author_Institution :
Dept. of Comput. Eng., Bogazici Univ., Istanbul, Turkey
fYear :
2009
Firstpage :
193
Lastpage :
198
Abstract :
We propose an exhaustive search algorithm that calculates the VC-dimension of univariate decision trees with binary features. The VC-dimension of the univariate decision tree with binary features depends on (i) the VC-dimension values of the left and right subtrees, (ii) the number of inputs, and (iii) the number of nodes in the tree. From a training set of example trees whose VC-dimensions are calculated by exhaustive search, we fit a general regressor to estimate the VC-dimension of any binary tree. These VC-dimension estimates are then used to get VC-generalization bounds for complexity control using SRM in decision trees, i.e., pruning. Our simulation results shows that SRM-pruning using the estimated VC-dimensions finds trees that are as accurate as those pruned using cross-validation.
Keywords :
"Decision trees","Training data","Virtual colonoscopy","Testing","Predictive models","Algorithm design and analysis","Design optimization","Regression tree analysis","Binary trees","Machine learning"
Publisher :
ieee
Conference_Titel :
Computer and Information Sciences, 2009. ISCIS 2009. 24th International Symposium on
Print_ISBN :
978-1-4244-5021-3
Type :
conf
DOI :
10.1109/ISCIS.2009.5291847
Filename :
5291847
Link To Document :
بازگشت