DocumentCode
656151
Title
Engineering High-Performance Community Detection Heuristics for Massive Graphs
Author
Staudt, Christian L. ; Meyerhenke, Henning
Author_Institution
Inst. of Theor. Inf., KarInstitute of Theoretical Informaticslsruhe Inst. of Technol. (KIT), Karlsruhe, Germany
fYear
2013
fDate
1-4 Oct. 2013
Firstpage
180
Lastpage
189
Abstract
The amount of graph-structured data has recently experienced an enormous growth in many applications. To transform such data into useful information, high-performance analytics algorithms and software tools are necessary. One common graph analytics kernel is community detection (or graph clustering). Despite extensive research on heuristic solvers for this task, only few parallel codes exist, although parallelism is often necessary to scale to the data volume of real-world applications. We address the deficit in computing capability by a flexible and extensible clustering algorithm framework with shared-memory parallelism. Within this framework we implement our parallel variations of known sequential algorithms and combine them by an ensemble approach. In extensive experiments driven by the algorithm engineering paradigm, we identify the most successful parameters and combinations of these algorithms. The processing rate of our fastest algorithm exceeds 10M edges/second for many large graphs, making it suitable for massive data streams. Moreover, the strongest algorithm we developed yields a very good tradeoff between quality and speed.
Keywords
data handling; graph theory; parallel processing; pattern clustering; performance evaluation; data volume; engineering high-performance community detection heuristics; graph analytics kernel; graph clustering; graph structured data; massive graphs; parallel codes; parallel variations; real-world applications; shared-memory parallelism; software tools; Algorithm design and analysis; Benchmark testing; Clustering algorithms; Communities; Graphics processing units; Linear programming; Parallel processing; Community detection; graph clustering; high-performance network analysis; parallel algorithm engineering;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing (ICPP), 2013 42nd International Conference on
Conference_Location
Lyon
ISSN
0190-3918
Type
conf
DOI
10.1109/ICPP.2013.27
Filename
6687351
Link To Document