• DocumentCode
    2439254
  • Title

    Attack-resistant frequency counting

  • Author

    Wu, Bo ; Saia, Jared ; King, Valerie

  • Author_Institution
    Dept. of Comput. Sci., Univ. of New Mexico, Albuquerque, NM, USA
  • fYear
    2010
  • fDate
    19-23 April 2010
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    We present collaborative peer-to-peer algorithms for the problem of approximating frequency counts for popular items distributed across the peers of a large-scale network. Our algorithms are attack-resistant in the sense that they function correctly even in the case where an adaptive and computationally unbounded adversary causes up to a 1/3 fraction of the peers in the network to suffer Byzantine faults. Our algorithms are scalable in the sense that all resource costs are polylogarithmic. Specifically, latency is O(log n); the number of messages and number of bits sent and received by each peer is O(log2n) per item; and number of neighbors of each peer is O(log2n). Our motivation for addressing this problem is to provide a tool for the following three applications: worm and virus detection; spam detection; and distributed data-mining. To the best of our knowledge, our algorithms are the first attack-resistant and scalable algorithms for this problem. Moreover, surprisingly, our algorithms seem to be the first attack-resistant algorithms for any data mining problem.
  • Keywords
    groupware; large-scale systems; peer-to-peer computing; security of data; Byzantine faults; attack resistant frequency counting; collaborative peer-to-peer algorithms; distributed data mining; large scale network; spam detection; virus detection; worm; Collaborative work; Computer networks; Computer science; Computer worms; Data mining; Fingerprint recognition; Frequency; Large-scale systems; Payloads; Peer to peer computing; Byzantine faults; butterfly; distributed; frequent items;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing (IPDPS), 2010 IEEE International Symposium on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4244-6442-5
  • Type

    conf

  • DOI
    10.1109/IPDPS.2010.5470344
  • Filename
    5470344