DocumentCode
2453062
Title
A Novel Noise Filtering Algorithm for Imbalanced Data
Author
Van Hulse, Jason ; Khoshgoftaar, Taghi M. ; Napolitano, Amri
Author_Institution
Florida Atlantic Univ., Boca Raton, FL, USA
fYear
2010
fDate
12-14 Dec. 2010
Firstpage
9
Lastpage
14
Abstract
Noise filtering is a commonly-used methodology to improve the performance of learners built using low-quality data. A common type of noise filtering is a data preprocessing technique called classification filtering. In classification filtering, a classifier is built and evaluated on the training dataset (typically using cross-validation) and any misclassified instances are considered noisy. The strategies employed with classification filters are not ideal, particularly when learning from class-imbalanced data. To address this deficiency, we propose an alternative method for classification filtering called the threshold-adjusted classification filter. This methodology is compared with the standard classification filter, and the results clearly demonstrate the efficacy of our technique.
Keywords
filtering theory; noise; pattern classification; cross-validation; data preprocessing technique; imbalanced data; noise filtering algorithm; threshold-adjusted classification filter; training dataset; Neodymium; Niobium; Noise; Noise level; Noise measurement; Training; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
Conference_Location
Washington, DC
Print_ISBN
978-1-4244-9211-4
Type
conf
DOI
10.1109/ICMLA.2010.9
Filename
5708806
Link To Document