DocumentCode :
1000468
Title :
Ensembling local learners ThroughMultimodal perturbation
Author :
Zhou, Zhi-Hua ; Yu, Yang
Author_Institution :
Nat. Lab. for Novel Software Technol., Nanjing Univ., China
Volume :
35
Issue :
4
fYear :
2005
Firstpage :
725
Lastpage :
735
Abstract :
Ensemble learning algorithms train multiple component learners and then combine their predictions. In order to generate a strong ensemble, the component learners should be with high accuracy as well as high diversity. A popularly used scheme in generating accurate but diverse component learners is to perturb the training data with resampling methods, such as the bootstrap sampling used in bagging. However, such a scheme is not very effective on local learners such as nearest-neighbor classifiers because a slight change in training data can hardly result in local learners with big differences. In this paper, a new ensemble algorithm named Filtered Attribute Subspace based Bagging with Injected Randomness (FASBIR) is proposed for building ensembles of local learners, which utilizes multimodal perturbation to help generate accurate but diverse component learners. In detail, FASBIR employs the perturbation on the training data with bootstrap sampling, the perturbation on the input attributes with attribute filtering and attribute subspace selection, and the perturbation on the learning parameters with randomly configured distance metrics. A large empirical study shows that FASBIR is effective in building ensembles of nearest-neighbor classifiers, whose performance is better than that of many other ensemble algorithms.
Keywords :
data mining; learning (artificial intelligence); sampling methods; bootstrap sampling; data mining; ensemble learning algorithm; filtered attribute subspace; machine learning; multimodal perturbation; multiple component learner; nearest-neighbor classifier; resampling method; stable base learner; Bagging; Decision trees; Diversity reception; Educational programs; Filtering; Machine learning; Machine learning algorithms; Neural networks; Sampling methods; Training data; Data mining; ensemble learning; local learner; machine learning; multimodal perturbation; nearest-neighbor classifier; stable base learner; Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Information Storage and Retrieval; Models, Statistical; Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated;
fLanguage :
English
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
1083-4419
Type :
jour
DOI :
10.1109/TSMCB.2005.845396
Filename :
1468246
Link To Document :
بازگشت