DocumentCode :
3580810
Title :
Hybrid sampling for multiclass imbalanced problem: Case study of students´ performance prediction
Author :
Prachuabsupakij, Wanthanee ; Soonthornphisaj, Nuanwan
Author_Institution :
Dept. of Inf. Technol., King Mongkut´s Univ. of Technol., Bangkok, Thailand
fYear :
2014
Firstpage :
321
Lastpage :
326
Abstract :
The aim of this paper is to propose a method namely CLUSS - CLUstering and SMOTE Sampling that can improve the prediction performance on multiclass imbalanced problem with students´ performance data. Firstly, the clustering approach is used to create a new subset from all majority classes. The new subsets consists of the groups of majority classes instances which have different characteristics. Secondly, oversampling technique is applied to generate the new synthetic minority class instances. Then, CLUSS constructs the new training set by combining all minority class instances and the majority class instances in each subset. Finally, for each training set decision tree is used as a classifier to predict the classes via majority vote. The experimental results show that CLUSS achieved high performance on both majority and minority classes.
Keywords :
data mining; educational administrative data processing; pattern clustering; CLUSS method; class prediction; clustering-and-SMOTE sampling method; hybrid sampling; majority class instances; majority vote; multiclass imbalanced problem; oversampling technique; prediction performance improvement; student performance data prediction; synthetic minority class instances; training set decision tree; Classification algorithms; Clustering algorithms; Data mining; Decision trees; Information technology; Prediction algorithms; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Computer Science and Information Systems (ICACSIS), 2014 International Conference on
Type :
conf
DOI :
10.1109/ICACSIS.2014.7065824
Filename :
7065824
Link To Document :
بازگشت