DocumentCode :
2474741
Title :
Imbalanced data classification algorithm based on boosting and cascade model
Author :
Zhang, Xiaolong ; Cheng, Chao
Author_Institution :
Sch. of Comput. Sci. & Technol., Wuhan Univ. of Sci. & Technol., Wuhan, China
fYear :
2012
fDate :
14-17 Oct. 2012
Firstpage :
2861
Lastpage :
2866
Abstract :
Traditional classification algorithms are difficult in dealing with imbalance data. This paper proposes a classification algorithm called CascadeBoost, which combines with the advantages of boosting algorithm and cascade model that can learn imbalance data. Cascade model allows the pre-training data to be balanced by gradually reducing the number of the major class; and then the most rich information samples based on the weight distribution can be gradually selected using boosting algorithm. The experimental results show that the proposed method obtains better performance compared to other methods.
Keywords :
learning (artificial intelligence); pattern classification; CascadeBoost; boosting algorithm; cascade model; imbalance data classification algorithm; information samples; pretraining data; weight distribution; Algorithm design and analysis; Boosting; Classification algorithms; Data models; Prediction algorithms; Support vector machines; Training; AUC; Boosting Algorithm; Cascade Model; Imbalance Data; SVM;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4673-1713-9
Electronic_ISBN :
978-1-4673-1712-2
Type :
conf
DOI :
10.1109/ICSMC.2012.6378183
Filename :
6378183
Link To Document :
بازگشت