Title :
Outlier detection via localized p-value estimation
Author :
Zhao, Manqi ; Saligrama, Venkatesh
Author_Institution :
Dept. of Electr. & Comput. Eng., Boston Univ., Boston, MA, USA
fDate :
Sept. 30 2009-Oct. 2 2009
Abstract :
We propose a novel non-parametric adaptive outlier detection algorithm, called LPE, for high dimensional data based on score functions derived from nearest neighbor graphs on n-point nominal data. Outliers are predicted whenever the score of a test sample falls below ¿, which is supposed to be the desired false alarm level. The resulting outlier detector is shown to be asymptotically optimal in that it is uniformly most powerful for the specified false alarm level, ¿, for the case when the density associated with the outliers is a mixture of the nominal and a known density. Our algorithm is computationally efficient, being linear in dimension and quadratic in data size. The whole empirical receiving operating characteristics (ROC) curve can be derived with almost no additional cost based on the estimated score function. It does not require choosing complicated tuning parameters or function approximation classes and it can adapt to local structure such as local change in dimensionality by incorporating the technique of manifold learning. We demonstrate the algorithm on both artificial and real data sets in high dimensional feature spaces.
Keywords :
data analysis; estimation theory; function approximation; graph theory; learning (artificial intelligence); nonparametric statistics; estimated score function; false alarm level; function approximation classes; high dimensional data; localized p-value estimation; manifold learning; n-point nominal data; nearest neighbor graphs; nonparametric adaptive outlier detection algorithm; receiving operating characteristics curve; score functions; tuning parameters; Change detection algorithms; Cost function; Detection algorithms; Detectors; Function approximation; Level set; Nearest neighbor searches; Support vector machines; Testing; Training data;
Conference_Titel :
Communication, Control, and Computing, 2009. Allerton 2009. 47th Annual Allerton Conference on
Conference_Location :
Monticello, IL
Print_ISBN :
978-1-4244-5870-7
DOI :
10.1109/ALLERTON.2009.5394501