DocumentCode :
1216875
Title :
Model selection for CART regression trees
Author :
Gey, Servane ; Nedelec, Elodie
Author_Institution :
Lab. Paris, France
Volume :
51
Issue :
2
fYear :
2005
Firstpage :
658
Lastpage :
670
Abstract :
The performance of the classification and regression trees (CART) pruning algorithm and the final discrete selection by test sample as a functional estimation procedure are considered. The validation of the pruning procedure applied to Gaussian and bounded regression is of primary interest. On the one hand, the paper shows that the complexity penalty used in the pruning algorithm is valid in both cases and, on the other hand, that, conditionally to the construction of the maximal tree, the final selection does not alter dramatically the estimation accuracy of the regression function. In both cases, the risk bounds that are proved, obtained by using the penalized model selection, validate the CART algorithm which is used in many applications such as meteorology, biology, medicine, pollution monitoring, or image coding.
Keywords :
Gaussian processes; information theory; piecewise constant techniques; regression analysis; trees (mathematics); CART; Gaussian-bounded regression; classification-regression trees; functional estimation procedure; maximal tree; model selection; pruning algorithm; Biological system modeling; Biomedical imaging; Classification tree analysis; Computational biology; Image coding; Meteorology; Monitoring; Pollution; Regression tree analysis; Testing;
fLanguage :
English
Journal_Title :
Information Theory, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9448
Type :
jour
DOI :
10.1109/TIT.2004.840903
Filename :
1386534
Link To Document :
بازگشت