• DocumentCode
    2453062
  • Title

    A Novel Noise Filtering Algorithm for Imbalanced Data

  • Author

    Van Hulse, Jason ; Khoshgoftaar, Taghi M. ; Napolitano, Amri

  • Author_Institution
    Florida Atlantic Univ., Boca Raton, FL, USA
  • fYear
    2010
  • fDate
    12-14 Dec. 2010
  • Firstpage
    9
  • Lastpage
    14
  • Abstract
    Noise filtering is a commonly-used methodology to improve the performance of learners built using low-quality data. A common type of noise filtering is a data preprocessing technique called classification filtering. In classification filtering, a classifier is built and evaluated on the training dataset (typically using cross-validation) and any misclassified instances are considered noisy. The strategies employed with classification filters are not ideal, particularly when learning from class-imbalanced data. To address this deficiency, we propose an alternative method for classification filtering called the threshold-adjusted classification filter. This methodology is compared with the standard classification filter, and the results clearly demonstrate the efficacy of our technique.
  • Keywords
    filtering theory; noise; pattern classification; cross-validation; data preprocessing technique; imbalanced data; noise filtering algorithm; threshold-adjusted classification filter; training dataset; Neodymium; Niobium; Noise; Noise level; Noise measurement; Training; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    978-1-4244-9211-4
  • Type

    conf

  • DOI
    10.1109/ICMLA.2010.9
  • Filename
    5708806