Author_Institution :
Dept. of Inf. Syst., Univ. of Haifa, Haifa, Israel
Abstract :
Local thresholding algorithms were first presented more than a decade ago and have since been applied to a variety of data mining tasks in peer-to-peer systems, wireless sensor networks, and in grid systems. One critical assumption made by those algorithms has always been cycle-free routing. The existence of even one cycle may lead all peers to the wrong outcome. Outside the lab, unfortunately, cycle freedom is not easy to achieve. This work is the first to lift the requirement of cycle freedom by presenting a local thresholding algorithm suitable for general network graphs. The algorithm relies on a new repositioning of the problem in weighted vector arithmetics, on a new stopping rule, whose proof does not require that the network be cycle free, and on new methods for balance correction when the stopping rule fails. The new stopping and update rules permit calculation of the very same functions that were calculable using previous algorithms, which do assume cycle freedom. The algorithm is implemented on a standard peer-to-peer simulator and is validated for networks of up to 80,000 peers, organized in three different topologies representative of major current distributed systems: the Internet, structured peer-to-peer systems, and wireless sensor networks.
Keywords :
Internet; data mining; graph theory; peer-to-peer computing; telecommunication network routing; telecommunication network topology; wireless sensor networks; Internet; balance correction; cycle freedom requirement; cycle-free routing; data mining tasks; distributed systems; general network graphs; local thresholding algorithms; standard peer-to-peer simulator; stopping rule; structured peer-to-peer systems; update rules; weighted vector arithmetics; wireless sensor networks; Approximation algorithms; Data mining; Nickel; Peer-to-peer computing; Silicon; Topology; Vectors; Distributed data mining; distributed convex thresholding; peer-to-peer; wireless sensor networks;