DocumentCode :
2478676
Title :
A supervised learning approach for imbalanced data sets
Author :
Nguyen, Giang H. ; Bouzerdoum, Abdesselam ; Phung, Son L.
Author_Institution :
Sch. of Electr., Univ. of Wollongong, Wollongong, NSW
fYear :
2008
fDate :
8-11 Dec. 2008
Firstpage :
1
Lastpage :
4
Abstract :
This paper presents a new learning approach for pattern classification applications involving imbalanced data sets. In this approach, a clustering technique is employed to resample the original training set into a smaller set of representative training exemplars, represented by weighted cluster centers and their target outputs. Based on the proposed learning approach, four training algorithms are derived for feed-forward neural networks. These algorithms are implemented and tested on three benchmark data sets. Experimental results show that with the proposed learning approach, it is possible to design networks to tackle the class imbalance problem, without compromising the overall classification performance.
Keywords :
feedforward neural nets; learning (artificial intelligence); pattern classification; pattern clustering; clustering technique; feedforward neural networks; imbalanced data sets; pattern classification applications; supervised learning; Application software; Clustering algorithms; Cost function; Data engineering; Electronic mail; Neural networks; Pattern classification; Supervised learning; Telecommunication computing; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location :
Tampa, FL
ISSN :
1051-4651
Print_ISBN :
978-1-4244-2174-9
Electronic_ISBN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2008.4761278
Filename :
4761278
Link To Document :
بازگشت