DocumentCode :
140755
Title :
Discriminative features for identifying and interpreting outliers
Author :
Xuan Hong Dang ; Assent, Ira ; Ng, Raymond T. ; Zimek, Arthur ; Schubert, Eugen
Author_Institution :
Dept. of Comput. Sci., Aarhus Univ., Aarhus, Denmark
fYear :
2014
fDate :
March 31 2014-April 4 2014
Firstpage :
88
Lastpage :
99
Abstract :
We consider the problem of outlier detection and interpretation. While most existing studies focus on the first problem, we simultaneously address the equally important challenge of outlier interpretation. We propose an algorithm that uncovers outliers in subspaces of reduced dimensionality in which they are well discriminated from regular objects while at the same time retaining the natural local structure of the original data to ensure the quality of outlier explanation. Our algorithm takes a mathematically appealing approach from the spectral graph embedding theory and we show that it achieves the globally optimal solution for the objective of subspace learning. By using a number of real-world datasets, we demonstrate its appealing performance not only w.r.t. the outlier detection rate but also w.r.t. the discriminative human-interpretable features. This is the first approach to exploit discriminative features for both outlier detection and interpretation, leading to better understanding of how and why the hidden outliers are exceptional.
Keywords :
data analysis; data mining; statistical distributions; discriminative human-interpretable features; outlier detection; outlier explanation quality; outlier interpretation; spectral graph embedding theory; Data mining; Eigenvalues and eigenfunctions; Face; Feature extraction; Linear programming; Sparse matrices; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2014 IEEE 30th International Conference on
Conference_Location :
Chicago, IL
Type :
conf
DOI :
10.1109/ICDE.2014.6816642
Filename :
6816642
Link To Document :
بازگشت