• DocumentCode
    117271
  • Title

    Deleting or keeping outliers for classifier training?

  • Author

    Tallon-Ballesteros, Antonio J. ; Riquelme, Jose C.

  • Author_Institution
    Dept. of Languages & Comput. Syst., Univ. of Seville, Seville, Spain
  • fYear
    2014
  • fDate
    July 30 2014-Aug. 1 2014
  • Firstpage
    281
  • Lastpage
    286
  • Abstract
    This paper introduces two statistical outlier detection approaches by classes. Experiments on binary and multi-class classification problems reveal that the partial removal of outliers improves significantly one or two performance measures for C4.5 and 1-nearest neighbour classifiers. Also, a taxonomy of problems according to the amount of outliers is proposed.
  • Keywords
    data mining; statistical analysis; binary classification problems; classifier training; data mining; multiclass classification problems; nearest neighbour classifiers; statistical outlier detection; Detectors; Training; attribute noise; classification; inter-quartile range; outlier detection; partial removal; statistical outlier detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Nature and Biologically Inspired Computing (NaBIC), 2014 Sixth World Congress on
  • Conference_Location
    Porto
  • Print_ISBN
    978-1-4799-5936-5
  • Type

    conf

  • DOI
    10.1109/NaBIC.2014.6921892
  • Filename
    6921892