Title :
Engineering High-Performance Community Detection Heuristics for Massive Graphs
Author :
Staudt, Christian L. ; Meyerhenke, Henning
Author_Institution :
Inst. of Theor. Inf., KarInstitute of Theoretical Informaticslsruhe Inst. of Technol. (KIT), Karlsruhe, Germany
Abstract :
The amount of graph-structured data has recently experienced an enormous growth in many applications. To transform such data into useful information, high-performance analytics algorithms and software tools are necessary. One common graph analytics kernel is community detection (or graph clustering). Despite extensive research on heuristic solvers for this task, only few parallel codes exist, although parallelism is often necessary to scale to the data volume of real-world applications. We address the deficit in computing capability by a flexible and extensible clustering algorithm framework with shared-memory parallelism. Within this framework we implement our parallel variations of known sequential algorithms and combine them by an ensemble approach. In extensive experiments driven by the algorithm engineering paradigm, we identify the most successful parameters and combinations of these algorithms. The processing rate of our fastest algorithm exceeds 10M edges/second for many large graphs, making it suitable for massive data streams. Moreover, the strongest algorithm we developed yields a very good tradeoff between quality and speed.
Keywords :
data handling; graph theory; parallel processing; pattern clustering; performance evaluation; data volume; engineering high-performance community detection heuristics; graph analytics kernel; graph clustering; graph structured data; massive graphs; parallel codes; parallel variations; real-world applications; shared-memory parallelism; software tools; Algorithm design and analysis; Benchmark testing; Clustering algorithms; Communities; Graphics processing units; Linear programming; Parallel processing; Community detection; graph clustering; high-performance network analysis; parallel algorithm engineering;
Conference_Titel :
Parallel Processing (ICPP), 2013 42nd International Conference on
Conference_Location :
Lyon
DOI :
10.1109/ICPP.2013.27