DocumentCode
1563942
Title
An Over-sampling Expert System for Learing from Imbalanced Data Sets
Author
He, Guoxun ; Han, Hui ; Wang, Wenyuan
Author_Institution
Dept. of Autom., Tsinghua Univ., Beijing
Volume
1
fYear
2005
Firstpage
537
Lastpage
541
Abstract
Learning from imbalanced datasets has become an important branch in the machine learning field. A relatively simple and effective method to solve the imbalance problem is re-sampling, which contains under-sampling and over-sampling. A representative over-sampling approach is SMOTE (synthetic minority over-sampling technique). However, it is not easy to decide the best distribution of minority and majority samples included in a given training set when SMOTE is applied to the imbalance situation. This paper presents an over-sampling expert system to ensemble classifiers trained on the data sets over-sampled at different rates. The proposed combination method, C-SMOTE, applied to several highly and moderately imbalanced data sets can automatically and intelligently obtain an optimal SMOTE rate, and shows improvement in prediction accuracy and overall F-measure on the minority class
Keywords
expert systems; learning (artificial intelligence); sampling methods; imbalanced data sets; machine learning; over-sampling expert system; synthetic minority over-sampling technique; Accuracy; Algorithm design and analysis; Automation; Embryo; Expert systems; Fault detection; Helium; Learning systems; Machine learning; Machine learning algorithms;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks and Brain, 2005. ICNN&B '05. International Conference on
Conference_Location
Beijing
Print_ISBN
0-7803-9422-4
Type
conf
DOI
10.1109/ICNNB.2005.1614671
Filename
1614671
Link To Document