• DocumentCode
    1479074
  • Title

    Anomaly Detection Using Proximity Graph and PageRank Algorithm

  • Author

    Yao, Zhe ; Mark, Philip ; Rabbat, Michael

  • Author_Institution
    Electr. & Comput. Eng. Dept., McGill Univ., Montréal, QC, Canada
  • Volume
    7
  • Issue
    4
  • fYear
    2012
  • Firstpage
    1288
  • Lastpage
    1300
  • Abstract
    Anomaly detection techniques are widely used in a variety of applications, e.g., computer networks, security systems, etc. This paper describes and analyzes an approach to anomaly detection using proximity graphs and the PageRank algorithm. We run a variant of the PageRank algorithm on top of a proximity graph comprised of data points as vertices, which produces a score quantifying the extent to which each data point is anomalous. Previous work in this direction requires first forming a density estimate using the training data, e.g., using kernel methods, and this step is very computationally intensive for high-dimensional data sets. Under mild assumptions and appropriately chosen parameters, we show that PageRank produces point-wise consistent probability density estimates for the data points in an asymptotic sense, and with much less computational effort. As a result, big improvements in terms of running time are witnessed while maintaining similar detection performance. Experiments with synthetic and real-world data sets illustrate that the proposed approach is computationally tractable and scales well to large high-dimensional data sets.
  • Keywords
    graph theory; probability; search engines; security of data; PageRank algorithm; anomaly detection; data points; high-dimensional data sets; kernel method; point-wise consistent probability density estimates; proximity graph; Damping; Estimation; Image edge detection; Kernel; Testing; Training data; Vectors; Anomaly detection; personalized PageRank; proximity graph;
  • fLanguage
    English
  • Journal_Title
    Information Forensics and Security, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1556-6013
  • Type

    jour

  • DOI
    10.1109/TIFS.2012.2191963
  • Filename
    6175122