• DocumentCode
    34268
  • Title

    Scalable Diversified Ranking on Large Graphs

  • Author

    Rong-Hua Li ; Yu, Jeffrey Xu

  • Author_Institution
    Dept. of Syst. Eng. & Eng. Manage., Chinese Univ. of Hong Kong, Hong Kong, China
  • Volume
    25
  • Issue
    9
  • fYear
    2013
  • fDate
    Sept. 2013
  • Firstpage
    2133
  • Lastpage
    2146
  • Abstract
    Enhancing diversity in ranking on graphs has been identified as an important retrieval and mining task. Nevertheless, many existing diversified ranking algorithms either cannot be scalable to large graphs due to the time or memory requirements, or lack an intuitive and reasonable diversified ranking measure. In this paper, we propose a new diversified ranking measure on large graphs, which captures both relevance and diversity, and formulate the diversified ranking problem as a submodular set function maximization problem. Based on the submodularity of the proposed measure, we develop an efficient greedy algorithm with linear time and space complexity w.r.t. the size of the graph to achieve near-optimal diversified ranking. In addition, we present a generalized diversified ranking measure and give a near-optimal randomized greedy algorithm with linear time and space complexity for optimizing it. We evaluate the proposed methods through extensive experiments on five real data sets. The experimental results demonstrate the effectiveness and efficiency of the proposed algorithms.
  • Keywords
    computational complexity; data mining; graph theory; greedy algorithms; information retrieval; optimisation; randomised algorithms; data mining task; diversified ranking algorithms; diversified ranking measure; generalized diversified ranking; information retrieval; linear time complexity; memory requirements; near-optimal diversified ranking; near-optimal randomized greedy algorithm; scalable diversified ranking; space complexity; submodular set function maximization problem; submodularity; Algorithm design and analysis; Approximation algorithms; Binary trees; Complexity theory; Diversity reception; Greedy algorithms; Vectors; Diversified ranking; Flajolet-Martin sketch; graph algorithms; scalability; submodular function;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2012.170
  • Filename
    6276206