Title :
Combining Distributed Classifies by Stacking
Author :
Wei Yanyan ; Li Taoshen ; Ge Zhihui
Author_Institution :
Sch. of Math. & Comput. Sci., Guangxi Univ. for Nat., Nanning, China
Abstract :
Many current mining tasks analyze data in environments with distributed computing nodes. Classification in such scenario needs to perform local mining task in each data site and then integrate local classifiers to a global model of the data. However, integration strategy can influence the performance and complexity of the final model. In this paper, based on the formalization of combining multiple classifiers by stacking in Distributed Data Mining, a new strategy to from meta-level training set is proposed, which can describe the vote made by each base-level classifiers. The experiment results show that our method achieve better performance for those datasets with highly skewed class distribution.
Keywords :
data mining; distributed processing; pattern classification; base-level classifiers; distributed classifiers; distributed computing; distributed data mining; metalevel training set; Accuracy; Costs; Data mining; Distributed computing; Distributed decision making; Genetics; Mathematics; Probability distribution; Stacking; Voting; DDM; classification; combining classifiers; stacking;
Conference_Titel :
Genetic and Evolutionary Computing, 2009. WGEC '09. 3rd International Conference on
Conference_Location :
Guilin
Print_ISBN :
978-0-7695-3899-0
DOI :
10.1109/WGEC.2009.146