• DocumentCode
    3346393
  • Title

    Combining Distributed Classifies by Stacking

  • Author

    Wei Yanyan ; Li Taoshen ; Ge Zhihui

  • Author_Institution
    Sch. of Math. & Comput. Sci., Guangxi Univ. for Nat., Nanning, China
  • fYear
    2009
  • fDate
    14-17 Oct. 2009
  • Firstpage
    418
  • Lastpage
    421
  • Abstract
    Many current mining tasks analyze data in environments with distributed computing nodes. Classification in such scenario needs to perform local mining task in each data site and then integrate local classifiers to a global model of the data. However, integration strategy can influence the performance and complexity of the final model. In this paper, based on the formalization of combining multiple classifiers by stacking in Distributed Data Mining, a new strategy to from meta-level training set is proposed, which can describe the vote made by each base-level classifiers. The experiment results show that our method achieve better performance for those datasets with highly skewed class distribution.
  • Keywords
    data mining; distributed processing; pattern classification; base-level classifiers; distributed classifiers; distributed computing; distributed data mining; metalevel training set; Accuracy; Costs; Data mining; Distributed computing; Distributed decision making; Genetics; Mathematics; Probability distribution; Stacking; Voting; DDM; classification; combining classifiers; stacking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Genetic and Evolutionary Computing, 2009. WGEC '09. 3rd International Conference on
  • Conference_Location
    Guilin
  • Print_ISBN
    978-0-7695-3899-0
  • Type

    conf

  • DOI
    10.1109/WGEC.2009.146
  • Filename
    5402861