DocumentCode :
3543367
Title :
Learning from imbalanced data using methods of sample selection
Author :
Chairi, I. ; Alaoui, Souad ; Lyhyaoui, Abdelouahid
Author_Institution :
LTILab, Abdelmalek Essaadi Univ., Tanger, Morocco
fYear :
2012
fDate :
10-12 May 2012
Firstpage :
254
Lastpage :
257
Abstract :
The majority of Machine Learning (ML) habitually assume that the training sets used for learning are balanced. However, in real world application this hypothesis is not always true. The problem of between-class imbalance is a challenge that has attracted growing attention from both academia and industry because of his critical influence on the performance of machine learning. Many solutions are proposed to resolve this problem: Generally, the common practice for dealing with imbalanced data sets is to rebalance them artificially by using sampling methods. On the other hand, researches show that Sample Selection (SS) methods help to improve the accuracy during the learning process. The main idea of our work is to apply a technique of Sample Selection on the majority class to achieve an undersampling for the imbalanced data. This procedure consent to deal with the imbalance problem and to improve the performance of learning.
Keywords :
data handling; learning (artificial intelligence); sampling methods; ML; SS methods; between-class imbalance; imbalanced data learning; learning process; machine learning; majority class; sample selection method; sampling methods; training sets; Accuracy; Artificial neural networks; Classification algorithms; IEEE transactions; Measurement uncertainty; Presses; Imbalanced data; Multi-Layer Perceptron; sample selection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Computing and Systems (ICMCS), 2012 International Conference on
Conference_Location :
Tangier
Print_ISBN :
978-1-4673-1518-0
Type :
conf
DOI :
10.1109/ICMCS.2012.6320291
Filename :
6320291
Link To Document :
بازگشت