• DocumentCode
    727920
  • Title

    Solving the class imbalance problems using RUSMultiBoost ensemble

  • Author

    Mustafa, Ghulam ; Zhendong Niu ; Yousif, Abdallah ; Tarus, John

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Beijing Inst. of Technol., Beijing, China
  • fYear
    2015
  • fDate
    17-20 June 2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    A data set is considered imbalanced when its class representation is substantially different. Examples of rare class are infrequent and cost more than common class examples in binary class imbalance data sets. Common learners usually incline toward common class and rare class examples are missed due to class imbalance. Ensemble learning approach combined with data resampling gains popularity to solve class imbalance problem, recently. RUSBoost and SMOTEBoost are two such methods that combine data resampling techniques with boosting procedure. We propose RUSMultiBoost, a hybrid method that is constituent of MultiBoost ensemble and random undersampling (RUS) to solve the class imbalance problem. Our new method is as simple as RUSBoost but more efficient and effective. We test our method on twelve data sets for class imbalance problem and compare the performance with simple and advanced hybrid ensemble methods. Experimental results show that our hybrid ensemble method performs significantly better than other methods on benchmark data sets using G-mean, Sensitivity and F1-measure. In addition, our method is also suitable for parallel execution as contrast to other boosting methods.
  • Keywords
    data handling; learning (artificial intelligence); sampling methods; F1-measure; G-mean; RUSMultiBoost ensemble; SMOTEBoost; binary class imbalance data sets; class imbalance problems; class representation; data resampling techniques; ensemble learning approach; hybrid ensemble methods; random undersampling; sensitivity; Bagging; Boosting; Committees; Diversity reception; Sensitivity; Training; Class imbalance learning; Multiboosting; ensemble learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Systems and Technologies (CISTI), 2015 10th Iberian Conference on
  • Conference_Location
    Aveiro
  • Type

    conf

  • DOI
    10.1109/CISTI.2015.7170597
  • Filename
    7170597