• DocumentCode
    710107
  • Title

    Diversified top-k clique search

  • Author

    Long Yuan ; Lu Qin ; Xuemin Lin ; Lijun Chang ; Wenjie Zhang

  • Author_Institution
    Univ. of New South Wales, Sydney, NSW, Australia
  • fYear
    2015
  • fDate
    13-17 April 2015
  • Firstpage
    387
  • Lastpage
    398
  • Abstract
    Maximal clique enumeration is a fundamental problem in graph theory and has been extensively studied. However, maximal clique enumeration is time-consuming in large graphs and always returns enormous cliques with large overlaps. Motivated by this, in this paper, we study the diversified top-k clique search problem which is to find top-k maximal cliques that can cover most number of nodes in the graph. Diversified top-k clique search can be widely used in a lot of applications including community search, motif discovery, and anomaly detection in large graphs. A naive solution for diversified top-k clique search is to keep all maximal cliques in memory and then find k of them that cover most nodes in the graph by using the approximate greedy max k-cover algorithm. However, such a solution is impractical when the graph is large. In this paper, instead of keeping all maximal cliques in memory, we devise an algorithm to maintain k candidates in the process of maximal clique enumeration. Our algorithm has limited memory footprint and can achieve a guaranteed approximation ratio. We also introduce a novel light-weight PNP-Index, based on which we design an optimal maximal clique maintenance algorithm. We further explore three optimization strategies to avoid enumerating all maximal cliques and thus largely reduce the computational cost. We conduct extensive performance studies on six real graphs one of which contains 0.3 billion edges, and the results demonstrate the high efficiency and effectiveness of our approach.
  • Keywords
    approximation theory; graph theory; search problems; anomaly detection; approximate greedy max k-cover algorithm; community search; diversified top-k clique search problem; graph nodes; graph theory; maximal clique enumeration; motif discovery; optimal maximal clique maintenance algorithm; top-k maximal cliques; Algorithm design and analysis; Approximation algorithms; Approximation methods; Communities; Computational efficiency; Optimization; Search problems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2015 IEEE 31st International Conference on
  • Conference_Location
    Seoul
  • Type

    conf

  • DOI
    10.1109/ICDE.2015.7113300
  • Filename
    7113300