DocumentCode
2064637
Title
A validity index for outlier detection
Author
Yousri, Noha A.
Author_Institution
Comput. & Syst. Eng., Alexandria Univ., Alexandria, Egypt
fYear
2010
fDate
Nov. 29 2010-Dec. 1 2010
Firstpage
325
Lastpage
329
Abstract
Defining a boundary between inliers and outliers is a major challenge in unsupervised outlier detection. In the absence of labeled data, the true outliers set cannot be evaluated. This lays the burden on both the choice of an efficient outlier detection criterion, and parameter selection. While numerous unsupervised outlier detection criteria, with different parameters, have been proposed, an unsupervised evaluation of outliers is still missing. This work introduces a theoretical basis, and proposes a validity index, to evaluate the quality of outliers. This is not a trivial problem when nothing is known about the structure and density of the data. The proposed index considers the outlierness quality, the deviation between characteristics of outliers and inliers, and the data distortion. Low and high dimensional data sets are used to evaluate the proposed index.
Keywords
data analysis; pattern classification; pattern clustering; unsupervised learning; data distortion; outlierness quality; parameter selection; true outliers set; unsupervised evaluation; unsupervised outlier detection criteria; validity index; Accuracy; Conferences; Density measurement; Distortion measurement; Indexes; Intelligent systems; Pattern recognition; outlier analysis; validity index;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems Design and Applications (ISDA), 2010 10th International Conference on
Conference_Location
Cairo
Print_ISBN
978-1-4244-8134-7
Type
conf
DOI
10.1109/ISDA.2010.5687245
Filename
5687245
Link To Document