• DocumentCode
    2474741
  • Title

    Imbalanced data classification algorithm based on boosting and cascade model

  • Author

    Zhang, Xiaolong ; Cheng, Chao

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Wuhan Univ. of Sci. & Technol., Wuhan, China
  • fYear
    2012
  • fDate
    14-17 Oct. 2012
  • Firstpage
    2861
  • Lastpage
    2866
  • Abstract
    Traditional classification algorithms are difficult in dealing with imbalance data. This paper proposes a classification algorithm called CascadeBoost, which combines with the advantages of boosting algorithm and cascade model that can learn imbalance data. Cascade model allows the pre-training data to be balanced by gradually reducing the number of the major class; and then the most rich information samples based on the weight distribution can be gradually selected using boosting algorithm. The experimental results show that the proposed method obtains better performance compared to other methods.
  • Keywords
    learning (artificial intelligence); pattern classification; CascadeBoost; boosting algorithm; cascade model; imbalance data classification algorithm; information samples; pretraining data; weight distribution; Algorithm design and analysis; Boosting; Classification algorithms; Data models; Prediction algorithms; Support vector machines; Training; AUC; Boosting Algorithm; Cascade Model; Imbalance Data; SVM;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    978-1-4673-1713-9
  • Electronic_ISBN
    978-1-4673-1712-2
  • Type

    conf

  • DOI
    10.1109/ICSMC.2012.6378183
  • Filename
    6378183