DocumentCode
2474741
Title
Imbalanced data classification algorithm based on boosting and cascade model
Author
Zhang, Xiaolong ; Cheng, Chao
Author_Institution
Sch. of Comput. Sci. & Technol., Wuhan Univ. of Sci. & Technol., Wuhan, China
fYear
2012
fDate
14-17 Oct. 2012
Firstpage
2861
Lastpage
2866
Abstract
Traditional classification algorithms are difficult in dealing with imbalance data. This paper proposes a classification algorithm called CascadeBoost, which combines with the advantages of boosting algorithm and cascade model that can learn imbalance data. Cascade model allows the pre-training data to be balanced by gradually reducing the number of the major class; and then the most rich information samples based on the weight distribution can be gradually selected using boosting algorithm. The experimental results show that the proposed method obtains better performance compared to other methods.
Keywords
learning (artificial intelligence); pattern classification; CascadeBoost; boosting algorithm; cascade model; imbalance data classification algorithm; information samples; pretraining data; weight distribution; Algorithm design and analysis; Boosting; Classification algorithms; Data models; Prediction algorithms; Support vector machines; Training; AUC; Boosting Algorithm; Cascade Model; Imbalance Data; SVM;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
Conference_Location
Seoul
Print_ISBN
978-1-4673-1713-9
Electronic_ISBN
978-1-4673-1712-2
Type
conf
DOI
10.1109/ICSMC.2012.6378183
Filename
6378183
Link To Document