Title :
Learning from imbalanced data using methods of sample selection
Author :
Chairi, I. ; Alaoui, Souad ; Lyhyaoui, Abdelouahid
Author_Institution :
LTILab, Abdelmalek Essaadi Univ., Tanger, Morocco
Abstract :
The majority of Machine Learning (ML) habitually assume that the training sets used for learning are balanced. However, in real world application this hypothesis is not always true. The problem of between-class imbalance is a challenge that has attracted growing attention from both academia and industry because of his critical influence on the performance of machine learning. Many solutions are proposed to resolve this problem: Generally, the common practice for dealing with imbalanced data sets is to rebalance them artificially by using sampling methods. On the other hand, researches show that Sample Selection (SS) methods help to improve the accuracy during the learning process. The main idea of our work is to apply a technique of Sample Selection on the majority class to achieve an undersampling for the imbalanced data. This procedure consent to deal with the imbalance problem and to improve the performance of learning.
Keywords :
data handling; learning (artificial intelligence); sampling methods; ML; SS methods; between-class imbalance; imbalanced data learning; learning process; machine learning; majority class; sample selection method; sampling methods; training sets; Accuracy; Artificial neural networks; Classification algorithms; IEEE transactions; Measurement uncertainty; Presses; Imbalanced data; Multi-Layer Perceptron; sample selection;
Conference_Titel :
Multimedia Computing and Systems (ICMCS), 2012 International Conference on
Conference_Location :
Tangier
Print_ISBN :
978-1-4673-1518-0
DOI :
10.1109/ICMCS.2012.6320291