Title :
Imbalanced data classification using random subspace method and SMOTE
Author :
Hsiao-Yun Huang ; Yi-Jhen Lin ; Youg-Siang Chen ; Hung-Yi Lu
Author_Institution :
Dept. of Stat. & Inf. Sci., Fu-Jen Catholic Univ., Taipei, Taiwan
Abstract :
Class imbalance problem has attracted many attentions in recent years. When the available training sample size of each class is imbalanced, the directly established classification model will tend to allocate the testing sample into the majority class. A proper resampling method together with a power classifier is generally employed for dealing with this problem. Many multi-classifier ensembles have been shown to outperform single classifier in many experiments. Bagging and boosting are two most popular multi-classifier frameworks and have been applied to deal with the class imbalance problem. By observing that the sample information of the minority class is very limited and the small sample size (SSS) problem might decrease the performance of the classifiers, another powerful multi-classifier method called random subspace method (RSM) is introduced to deal with the class imbalance problem in this study. To evaluate the performance of different classifiers, a well-known resampling method called SMOTE is employed. The experiment results showed RSM has the best performance in most of the considered situations.
Keywords :
learning (artificial intelligence); pattern classification; sampling methods; SMOTE method; SSS problem; bagging framework; boosting framework; class imbalance problem; imbalanced data classification; multiclassifier ensemble; random subspace method; resampling method; small sample size problem; Bagging; Boosting; Class imbalance; Multi-classifier ensemble; RSM; SMOTE;
Conference_Titel :
Soft Computing and Intelligent Systems (SCIS) and 13th International Symposium on Advanced Intelligent Systems (ISIS), 2012 Joint 6th International Conference on
Conference_Location :
Kobe
Print_ISBN :
978-1-4673-2742-8
DOI :
10.1109/SCIS-ISIS.2012.6505155