Title :
Estimating expected error rates of random forest classifiers: A comparison of cross-validation and bootstrap
Author :
Ljumovic, Milica ; Klar, Michael
Author_Institution :
Fac. of Electr. Eng. Podgoric, Univ. of Montenegro, Podgorica, Montenegro
Abstract :
Statistical learning has recently seen an expansion of applications in different areas of science, finance and industry, as it plays a great role within the fields of statistics, data mining and artificial intelligence. Hence, it intersects with areas of engineering and other disciplines as well. It is used for both regression and classification problems. Solving these problems usually involves building/training a model/classifier and validating its performance for a given task. In this paper we compare two resampling methods for assessment of a random forest classifier: k-fold cross-validation and bootstrap. We use these methods to estimate the generalization error and to create learning curves. Both methods yield similar results on our data. The most important requirement for good generalization error estimates of either method is that the used data sample (i.e. the training dataset) represents the unknown true distribution of the data. This requirement cannot always be met in practice and results of resampling methods have to be interpreted with care if it is violated.
Keywords :
generalisation (artificial intelligence); learning (artificial intelligence); pattern classification; random processes; sampling methods; bootstrap; classification problem; expected error rate estimation; generalization error estimation; k-fold cross-validation; learning curves; random forest classifiers; regression problem; resampling methods; statistical learning; Buildings; Embedded computing; Error analysis; Statistical learning; Training; Vegetation; bootstrap; classifier; cross-validation; learning curves; machine learning; resampling methods; statistical learning;
Conference_Titel :
Embedded Computing (MECO), 2015 4th Mediterranean Conference on
Conference_Location :
Budva
Print_ISBN :
978-1-4799-8999-7
DOI :
10.1109/MECO.2015.7181905