• DocumentCode
    656151
  • Title

    Engineering High-Performance Community Detection Heuristics for Massive Graphs

  • Author

    Staudt, Christian L. ; Meyerhenke, Henning

  • Author_Institution
    Inst. of Theor. Inf., KarInstitute of Theoretical Informaticslsruhe Inst. of Technol. (KIT), Karlsruhe, Germany
  • fYear
    2013
  • fDate
    1-4 Oct. 2013
  • Firstpage
    180
  • Lastpage
    189
  • Abstract
    The amount of graph-structured data has recently experienced an enormous growth in many applications. To transform such data into useful information, high-performance analytics algorithms and software tools are necessary. One common graph analytics kernel is community detection (or graph clustering). Despite extensive research on heuristic solvers for this task, only few parallel codes exist, although parallelism is often necessary to scale to the data volume of real-world applications. We address the deficit in computing capability by a flexible and extensible clustering algorithm framework with shared-memory parallelism. Within this framework we implement our parallel variations of known sequential algorithms and combine them by an ensemble approach. In extensive experiments driven by the algorithm engineering paradigm, we identify the most successful parameters and combinations of these algorithms. The processing rate of our fastest algorithm exceeds 10M edges/second for many large graphs, making it suitable for massive data streams. Moreover, the strongest algorithm we developed yields a very good tradeoff between quality and speed.
  • Keywords
    data handling; graph theory; parallel processing; pattern clustering; performance evaluation; data volume; engineering high-performance community detection heuristics; graph analytics kernel; graph clustering; graph structured data; massive graphs; parallel codes; parallel variations; real-world applications; shared-memory parallelism; software tools; Algorithm design and analysis; Benchmark testing; Clustering algorithms; Communities; Graphics processing units; Linear programming; Parallel processing; Community detection; graph clustering; high-performance network analysis; parallel algorithm engineering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing (ICPP), 2013 42nd International Conference on
  • Conference_Location
    Lyon
  • ISSN
    0190-3918
  • Type

    conf

  • DOI
    10.1109/ICPP.2013.27
  • Filename
    6687351