DocumentCode :
3540705
Title :
New statistic in P-value estimation for anomaly detection
Author :
Qian, Jing ; Saligrama, Venkatesh
Author_Institution :
Boston Univ., Boston, MA, USA
fYear :
2012
fDate :
5-8 Aug. 2012
Firstpage :
393
Lastpage :
396
Abstract :
Given n nominal samples, a query point η and a significance level a, the uniformly most powerful test for anomaly detection can be to test p(η) ≤ α, where p(η) is the p-value function of η. In [1] a p-value estimator is proposed which is based on ranking some statistic over all data samples, and is shown to be asymptotically consistent. Relying on this framework we propose a new statistic for p-value estimation. It is based on the average of K nearest neighbor (K-NN) distances of η within a K-NN graph constructed from n nominal training samples. We also provide a bootstrapping strategy for estimating p-values which leads to better robustness. We then theoretically justify the asymptotic consistency of our ideas through a finite sample analysis. Synthetic and real experiments demonstrate the superiorities of our scheme.
Keywords :
estimation theory; graph theory; learning (artificial intelligence); pattern classification; sampling methods; security of data; K nearest neighbor distance; K-NN distance; K-NN graph; anomaly detection; asymptotic consistency; bootstrapping strategy; finite sample analysis; p-value estimation; p-value function; query point; statistics; Convergence; Estimation; Machine learning; Robustness; Support vector machines; Testing; Training; Anomaly Detection; k-NN graph; p-value;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Statistical Signal Processing Workshop (SSP), 2012 IEEE
Conference_Location :
Ann Arbor, MI
ISSN :
pending
Print_ISBN :
978-1-4673-0182-4
Electronic_ISBN :
pending
Type :
conf
DOI :
10.1109/SSP.2012.6319713
Filename :
6319713
Link To Document :
بازگشت