Title : 
A classification algorithm for noisy data streams
         
        
            Author : 
Yan Li ; Zhang, Yuhong ; Hu Xuegang ; Li Peipei
         
        
            Author_Institution : 
Sch. of Comput. & Inf., Hefei Univ. of Technol., Hefei, China
         
        
        
        
        
        
        
            Abstract : 
Classification on noisy data streams has recently become one of the most important topics in streaming data mining. In this paper, a Classification algorithm for mining Data Streams based on Mixture Models of C4.5 and NB is proposed called CDSMM. In this algorithm, C4.5 is used as the base classifiers, the hypothesis testing method is introduced for the detection of concept drifts, and a Naïve Bayes classifier is adopted to filter noise. Extensive experiments demonstrate that CDSMM has substantial advantages over similar existing algorithms in the predictive accuracy on noisy data streams with concept drifts.
         
        
            Keywords : 
Bayes methods; data mining; pattern classification; Bayes classifier; C4.5; CDSMM; hypothesis testing method; noisy data stream classification; streaming data mining; Accuracy; Classification algorithms; Data mining; Niobium; Noise; Noise measurement; Prediction algorithms; Classification; Concept Drift; Data Stream; Noise;
         
        
        
        
            Conference_Titel : 
Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
         
        
            Conference_Location : 
Yantai, Shandong
         
        
            Print_ISBN : 
978-1-4244-5931-5
         
        
        
            DOI : 
10.1109/FSKD.2010.5569533