DocumentCode :
2866647
Title :
Example-based robust outlier detection in high dimensional datasets
Author :
Zhu, Cui ; Kitagawa, Hiroyuki ; Faloutsos, Christos
Author_Institution :
Graduate Sch. of Syst. & Inf. Eng., Tsukuba Univ., Ibaraki, Japan
fYear :
2005
fDate :
27-30 Nov. 2005
Abstract :
Detecting outliers is an important problem. Most of its applications typically possess high dimensional datasets. In high dimensional space, the data becomes sparse which implies that every object can be regarded as an outlier from the point of view of similarity. Furthermore, a fundamental issue is that the notion of which objects are outliers typically varies between users, problem domains or, even, datasets. In this paper, we present a novel robust solution which detects high dimensional outliers based on user examples and tolerates incorrect inputs. It studies the behavior of projections of such a few examples, to discover further objects that are outstanding in the projection where many examples are outlying. Our experiments on both real and synthetic datasets demonstrate the ability of the proposed method to detect outliers corresponding to the user examples.
Keywords :
data analysis; example-based robust outlier detection; high dimensional data sets; high dimensional outlier; Application software; Clustering algorithms; Computer science; Data mining; Design methodology; Monitoring; Object detection; Robustness; Systems engineering and theory; Unsupervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, Fifth IEEE International Conference on
ISSN :
1550-4786
Print_ISBN :
0-7695-2278-5
Type :
conf
DOI :
10.1109/ICDM.2005.59
Filename :
1565793
Link To Document :
بازگشت