Title :
Novel Cost-Sensitive Approach to Improve the Multilayer Perceptron Performance on Imbalanced Data
Author :
Castro, C.L. ; Braga, Antonio Padua
Author_Institution :
Grad. Program in Electr. Eng., Fed. Univ. of Minas Gerais, Belo Horizonte, Brazil
Abstract :
Traditional learning algorithms applied to complex and highly imbalanced training sets may not give satisfactory results when distinguishing between examples of the classes. The tendency is to yield classification models that are biased towards the overrepresented (majority) class. This paper investigates this class imbalance problem in the context of multilayer perceptron (MLP) neural networks. The consequences of the equal cost (loss) assumption on imbalanced data are formally discussed from a statistical learning theory point of view. A new cost-sensitive algorithm (CSMLP) is presented to improve the discrimination ability of (two-class) MLPs. The CSMLP formulation is based on a joint objective function that uses a single cost parameter to distinguish the importance of class errors. The learning rule extends the Levenberg-Marquadt´s rule, ensuring the computational efficiency of the algorithm. In addition, it is theoretically demonstrated that the incorporation of prior information via the cost parameter may lead to balanced decision boundaries in the feature space. Based on the statistical analysis of results on real data, our approach shows a significant improvement of the area under the receiver operating characteristic curve and G-mean measures of regular MLPs.
Keywords :
learning (artificial intelligence); multilayer perceptrons; pattern classification; sensitivity analysis; statistical analysis; CSMLP formulation; G-mean measures; Levenberg-Marquadt rule; MLP neural networks; balanced decision boundaries; class imbalance problem; classification models; cost-sensitive algorithm; discrimination ability improvement; imbalanced data; joint objective function; learning rule; multilayer perceptron performance improvement; receiver operating characteristic curve; statistical analysis; statistical learning theory; Algorithm design and analysis; Cost function; Jacobian matrices; Joints; Multilayer perceptrons; Training; Vectors; Class imbalance problem; Levenberg–Marquadt; cost-sensitive approach; multilayer perceptron; objective function;
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
DOI :
10.1109/TNNLS.2013.2246188