Title : 
Using Boosting and Clustering to Prune Bagging and Detect Noisy Data
         
        
            Author : 
Xie, Yuan Cheng ; Yang, Jing Yu
         
        
            Author_Institution : 
Sch. of Comput. Sci. & Technol., Nanjing Univ. of Sci. & Technol., Nanjing, China
         
        
        
        
        
        
            Abstract : 
AdaBoost has been the representation of ensemble learning algorithm because of its excellent performance. However, due to its longtime training, AdaBoost was complained about by people and this defect limits the practical application. Bagging is a rapid method of training and supports for parallel computing. One of important factors that can affect the performance of ensemble learning is the diversity of component learners. Based on this view, a new algorithm using clustering and boosting to prune Bagging ensembles is proposed in this paper. Its learning efficiency is close to Bagging and its performance is close to AdaBoost. Furthermore, this new algorithm can detect noisy data from original samples based on cascade technique, and a better result of noise detection can be acquired.
         
        
            Keywords : 
learning (artificial intelligence); pattern clustering; AdaBoost; data clustering; ensemble learning; noisy data detection; parallel computing; prune bagging; Bagging; Boosting; Clustering algorithms; Computer science; Educational institutions; Information science; Neural networks; Parallel processing; Reactive power;
         
        
        
        
            Conference_Titel : 
Pattern Recognition, 2009. CCPR 2009. Chinese Conference on
         
        
            Conference_Location : 
Nanjing
         
        
            Print_ISBN : 
978-1-4244-4199-0
         
        
        
            DOI : 
10.1109/CCPR.2009.5344126