DocumentCode :
11376
Title :
Surfing the Network for Ranking by Multidamping
Author :
Kollias, Giorgos ; Gallopoulos, Efstratios ; Grama, A.
Author_Institution :
Dept. of Comput. Sci., Purdue Univ., West Lafayette, IN, USA
Volume :
26
Issue :
9
fYear :
2014
fDate :
Sept. 2014
Firstpage :
2323
Lastpage :
2336
Abstract :
PageRank is one of the most commonly used techniques for ranking nodes in a network. It is a special case of a family of link-based rankings, commonly referred to as functional rankings. Functional rankings are computed as power series of a stochastic matrix derived from the adjacency matrix of the graph. This general formulation of functional rankings enables their use in diverse applications, ranging from traditional search applications to identification of spam and outliers in networks. This paper presents a novel algorithmic (re)formulation of commonly used functional rankings, such as LinearRank, TotalRank and Generalized Hyperbolic Rank. These rankings can be approximated by finite series representations. We prove that polynomials of stochastic matrices can be expressed as products of Google matrices (matrices having the form used in Google´s original PageRank formulation). Individual matrices in these products are parameterized by different damping factors. For this reason, we refer to our formulation as multidamping. We demonstrate that multidamping has a number of desirable characteristics: (i) for problems such as finding the highest ranked pages, multidamping admits extremely fast approximate solutions; (ii) multidamping provides an intuitive interpretation of existing functional rankings in terms of the surfing habits of model web users; (iii) multidamping provides a natural framework based on Monte Carlo type methods that have efficient parallel and distributed implementations. It also provides the basis for constructing new link-based rankings based on inhomogeneous products of Google matrices. We present algorithms for computing damping factors for existing functional rankings analytically and numerically. We validate various benefits of multidamping on a number of real datasets.
Keywords :
Internet; Monte Carlo methods; data mining; matrix algebra; polynomial matrices; stochastic processes; Google matrices; LinearRank; Monte Carlo type methods; PageRank; TotalRank; Web mining methods; adjacency matrix; damping factors; finite series representations; functional rankings; generalized hyperbolic rank; inhomogeneous products; link-based rankings; multidamping process; node ranking; outlier identification; spam identification; stochastic polynomial matrix; Approximation methods; Damping; Google; Markov processes; Nonhomogeneous media; Vectors; Computing Methodologies; Database Applications; Database Management; Discrete Mathematics; Graph Theory; Graph algorithms; Information Technology and Systems; Markov processes; Mathematics of Computing; Mining methods and algorithms; Modeling; Monte Carlo; Personalization; Probability and Statistics; Ranking; Simulation; Types of Simulation; Web mining; and Visualization;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2013.15
Filename :
6412669
Link To Document :
بازگشت