• DocumentCode
    3540705
  • Title

    New statistic in P-value estimation for anomaly detection

  • Author

    Qian, Jing ; Saligrama, Venkatesh

  • Author_Institution
    Boston Univ., Boston, MA, USA
  • fYear
    2012
  • fDate
    5-8 Aug. 2012
  • Firstpage
    393
  • Lastpage
    396
  • Abstract
    Given n nominal samples, a query point η and a significance level a, the uniformly most powerful test for anomaly detection can be to test p(η) ≤ α, where p(η) is the p-value function of η. In [1] a p-value estimator is proposed which is based on ranking some statistic over all data samples, and is shown to be asymptotically consistent. Relying on this framework we propose a new statistic for p-value estimation. It is based on the average of K nearest neighbor (K-NN) distances of η within a K-NN graph constructed from n nominal training samples. We also provide a bootstrapping strategy for estimating p-values which leads to better robustness. We then theoretically justify the asymptotic consistency of our ideas through a finite sample analysis. Synthetic and real experiments demonstrate the superiorities of our scheme.
  • Keywords
    estimation theory; graph theory; learning (artificial intelligence); pattern classification; sampling methods; security of data; K nearest neighbor distance; K-NN distance; K-NN graph; anomaly detection; asymptotic consistency; bootstrapping strategy; finite sample analysis; p-value estimation; p-value function; query point; statistics; Convergence; Estimation; Machine learning; Robustness; Support vector machines; Testing; Training; Anomaly Detection; k-NN graph; p-value;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Statistical Signal Processing Workshop (SSP), 2012 IEEE
  • Conference_Location
    Ann Arbor, MI
  • ISSN
    pending
  • Print_ISBN
    978-1-4673-0182-4
  • Electronic_ISBN
    pending
  • Type

    conf

  • DOI
    10.1109/SSP.2012.6319713
  • Filename
    6319713