DocumentCode
2256252
Title
Comparison of subsampling techniques for random subspace ensembles
Author
Pathical, Santhosh ; Serpen, Gursel
Author_Institution
Electr. Eng. & Comput. Sci. Dept., Univ. of Toledo, Toledo, OH, USA
Volume
1
fYear
2010
fDate
11-14 July 2010
Firstpage
380
Lastpage
385
Abstract
This paper presents the comparison of three subsampling techniques for random subspace ensemble classifiers through an empirical study. A version of random subspace ensemble designed to address the challenges of high dimensional classification, entitled random subsample ensemble, within the voting combiner framework was evaluated for its performance for three different sampling methods which entailed random sampling without replacement, random sampling with replacement, and random partitioning. The random subsample ensemble was instantiated using three different base learners including C4.5, k-nearest neighbor, and naïve Bayes, and tested on five high-dimensional benchmark data sets in machine learning. Simulation results helped ascertain the optimal sampling technique for the ensemble, which turned out to be the sampling without replacement.
Keywords
learning (artificial intelligence); pattern classification; sampling methods; C4.5 learning; k-nearest neighbor learning; machine learning; naive Bayes learning; random partitioning method; random sampling with replacement method; random sampling without replacement method; random subspace ensemble classification; subsampling techniques; voting combiner framework; Accuracy; Classification algorithms; Machine learning; Machine learning algorithms; Measurement; Partitioning algorithms; Prediction algorithms; Curse of dimensionality; Ensemble classification; Random subsampling; Random subspace;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics (ICMLC), 2010 International Conference on
Conference_Location
Qingdao
Print_ISBN
978-1-4244-6526-2
Type
conf
DOI
10.1109/ICMLC.2010.5581032
Filename
5581032
Link To Document