• DocumentCode
    2064637
  • Title

    A validity index for outlier detection

  • Author

    Yousri, Noha A.

  • Author_Institution
    Comput. & Syst. Eng., Alexandria Univ., Alexandria, Egypt
  • fYear
    2010
  • fDate
    Nov. 29 2010-Dec. 1 2010
  • Firstpage
    325
  • Lastpage
    329
  • Abstract
    Defining a boundary between inliers and outliers is a major challenge in unsupervised outlier detection. In the absence of labeled data, the true outliers set cannot be evaluated. This lays the burden on both the choice of an efficient outlier detection criterion, and parameter selection. While numerous unsupervised outlier detection criteria, with different parameters, have been proposed, an unsupervised evaluation of outliers is still missing. This work introduces a theoretical basis, and proposes a validity index, to evaluate the quality of outliers. This is not a trivial problem when nothing is known about the structure and density of the data. The proposed index considers the outlierness quality, the deviation between characteristics of outliers and inliers, and the data distortion. Low and high dimensional data sets are used to evaluate the proposed index.
  • Keywords
    data analysis; pattern classification; pattern clustering; unsupervised learning; data distortion; outlierness quality; parameter selection; true outliers set; unsupervised evaluation; unsupervised outlier detection criteria; validity index; Accuracy; Conferences; Density measurement; Distortion measurement; Indexes; Intelligent systems; Pattern recognition; outlier analysis; validity index;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications (ISDA), 2010 10th International Conference on
  • Conference_Location
    Cairo
  • Print_ISBN
    978-1-4244-8134-7
  • Type

    conf

  • DOI
    10.1109/ISDA.2010.5687245
  • Filename
    5687245