Title : 
A Novel Noise Filtering Algorithm for Imbalanced Data
         
        
            Author : 
Van Hulse, Jason ; Khoshgoftaar, Taghi M. ; Napolitano, Amri
         
        
            Author_Institution : 
Florida Atlantic Univ., Boca Raton, FL, USA
         
        
        
        
        
        
            Abstract : 
Noise filtering is a commonly-used methodology to improve the performance of learners built using low-quality data. A common type of noise filtering is a data preprocessing technique called classification filtering. In classification filtering, a classifier is built and evaluated on the training dataset (typically using cross-validation) and any misclassified instances are considered noisy. The strategies employed with classification filters are not ideal, particularly when learning from class-imbalanced data. To address this deficiency, we propose an alternative method for classification filtering called the threshold-adjusted classification filter. This methodology is compared with the standard classification filter, and the results clearly demonstrate the efficacy of our technique.
         
        
            Keywords : 
filtering theory; noise; pattern classification; cross-validation; data preprocessing technique; imbalanced data; noise filtering algorithm; threshold-adjusted classification filter; training dataset; Neodymium; Niobium; Noise; Noise level; Noise measurement; Training; Training data;
         
        
        
        
            Conference_Titel : 
Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
         
        
            Conference_Location : 
Washington, DC
         
        
            Print_ISBN : 
978-1-4244-9211-4
         
        
        
            DOI : 
10.1109/ICMLA.2010.9