DocumentCode :
1563942
Title :
An Over-sampling Expert System for Learing from Imbalanced Data Sets
Author :
He, Guoxun ; Han, Hui ; Wang, Wenyuan
Author_Institution :
Dept. of Autom., Tsinghua Univ., Beijing
Volume :
1
fYear :
2005
Firstpage :
537
Lastpage :
541
Abstract :
Learning from imbalanced datasets has become an important branch in the machine learning field. A relatively simple and effective method to solve the imbalance problem is re-sampling, which contains under-sampling and over-sampling. A representative over-sampling approach is SMOTE (synthetic minority over-sampling technique). However, it is not easy to decide the best distribution of minority and majority samples included in a given training set when SMOTE is applied to the imbalance situation. This paper presents an over-sampling expert system to ensemble classifiers trained on the data sets over-sampled at different rates. The proposed combination method, C-SMOTE, applied to several highly and moderately imbalanced data sets can automatically and intelligently obtain an optimal SMOTE rate, and shows improvement in prediction accuracy and overall F-measure on the minority class
Keywords :
expert systems; learning (artificial intelligence); sampling methods; imbalanced data sets; machine learning; over-sampling expert system; synthetic minority over-sampling technique; Accuracy; Algorithm design and analysis; Automation; Embryo; Expert systems; Fault detection; Helium; Learning systems; Machine learning; Machine learning algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks and Brain, 2005. ICNN&B '05. International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7803-9422-4
Type :
conf
DOI :
10.1109/ICNNB.2005.1614671
Filename :
1614671
Link To Document :
بازگشت