• DocumentCode
    2741848
  • Title

    An Improved AdaBoost Algorithm for Unbalanced Classification Data

  • Author

    Song, Jie ; Lu, Xiaoling ; Wu, Xizhi

  • Author_Institution
    Sch. of Stat., Renmin Univ. of China, Beijing, China
  • Volume
    1
  • fYear
    2009
  • fDate
    14-16 Aug. 2009
  • Firstpage
    109
  • Lastpage
    113
  • Abstract
    AdaBoost algorithm is proved to be a very efficient classification method for the balanced dataset with all classes having similar proportions. However, in real application, it is quite common to have unbalanced dataset with a certain class of interest having very small size. It will be problematic since the algorithm might predict all the cases into majority classes without loss of overall accuracy. This paper proposes an improved AdaBoost algorithm called BABoost (Balanced AdaBoost), which gives higher weights to the misclassified examples from the minority class. Empirical results show that the new method decreases the prediction error of minority class significantly with increasing the prediction error of majority class a little bit. It can also produce higher values of margin which indicates a better classification method.
  • Keywords
    Ada; data handling; pattern classification; BABoost; balanced AdaBoost; unbalanced classification data; Bagging; Classification algorithms; Fuzzy systems; Machine learning; Machine learning algorithms; Prediction algorithms; Probability; Sampling methods; Statistics; Voting; AdaBoost; classification; multiclass; unbalanced data; within group error;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-0-7695-3735-1
  • Type

    conf

  • DOI
    10.1109/FSKD.2009.608
  • Filename
    5358645