• DocumentCode
    11376
  • Title

    Surfing the Network for Ranking by Multidamping

  • Author

    Kollias, Giorgos ; Gallopoulos, Efstratios ; Grama, A.

  • Author_Institution
    Dept. of Comput. Sci., Purdue Univ., West Lafayette, IN, USA
  • Volume
    26
  • Issue
    9
  • fYear
    2014
  • fDate
    Sept. 2014
  • Firstpage
    2323
  • Lastpage
    2336
  • Abstract
    PageRank is one of the most commonly used techniques for ranking nodes in a network. It is a special case of a family of link-based rankings, commonly referred to as functional rankings. Functional rankings are computed as power series of a stochastic matrix derived from the adjacency matrix of the graph. This general formulation of functional rankings enables their use in diverse applications, ranging from traditional search applications to identification of spam and outliers in networks. This paper presents a novel algorithmic (re)formulation of commonly used functional rankings, such as LinearRank, TotalRank and Generalized Hyperbolic Rank. These rankings can be approximated by finite series representations. We prove that polynomials of stochastic matrices can be expressed as products of Google matrices (matrices having the form used in Google´s original PageRank formulation). Individual matrices in these products are parameterized by different damping factors. For this reason, we refer to our formulation as multidamping. We demonstrate that multidamping has a number of desirable characteristics: (i) for problems such as finding the highest ranked pages, multidamping admits extremely fast approximate solutions; (ii) multidamping provides an intuitive interpretation of existing functional rankings in terms of the surfing habits of model web users; (iii) multidamping provides a natural framework based on Monte Carlo type methods that have efficient parallel and distributed implementations. It also provides the basis for constructing new link-based rankings based on inhomogeneous products of Google matrices. We present algorithms for computing damping factors for existing functional rankings analytically and numerically. We validate various benefits of multidamping on a number of real datasets.
  • Keywords
    Internet; Monte Carlo methods; data mining; matrix algebra; polynomial matrices; stochastic processes; Google matrices; LinearRank; Monte Carlo type methods; PageRank; TotalRank; Web mining methods; adjacency matrix; damping factors; finite series representations; functional rankings; generalized hyperbolic rank; inhomogeneous products; link-based rankings; multidamping process; node ranking; outlier identification; spam identification; stochastic polynomial matrix; Approximation methods; Damping; Google; Markov processes; Nonhomogeneous media; Vectors; Computing Methodologies; Database Applications; Database Management; Discrete Mathematics; Graph Theory; Graph algorithms; Information Technology and Systems; Markov processes; Mathematics of Computing; Mining methods and algorithms; Modeling; Monte Carlo; Personalization; Probability and Statistics; Ranking; Simulation; Types of Simulation; Web mining; and Visualization;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2013.15
  • Filename
    6412669