DocumentCode :
82467
Title :
The Sum-over-Forests Density Index: Identifying Dense Regions in a Graph
Author :
Senelle, Mathieu ; Garcia-Diez, Silvia ; Mantrach, Amin ; Shimbo, Masashi ; Saerens, Marco ; Fouss, Francois
Author_Institution :
Louvain Sch. of Manage. (LSM), Univ. catholique de Louvain (UCL), Mons, Belgium
Volume :
36
Issue :
6
fYear :
2014
fDate :
Jun-14
Firstpage :
1268
Lastpage :
1274
Abstract :
This work introduces a novel nonparametric density index defined on graphs, the Sum-over-Forests (SoF) density index. It is based on a clear and intuitive idea: high-density regions in a graph are characterized by the fact that they contain a large amount of low-cost trees with high outdegrees while low-density regions contain few ones. Therefore, a Boltzmann probability distribution on the countable set of forests in the graph is defined so that large (high-cost) forests occur with a low probability while short (low-cost) forests occur with a high probability. Then, the SoF density index of a node is defined as the expected outdegree of this node on the set of forests, thus providing a measure of density around that node. Following the matrix-forest theorem and a statistical physics framework, it is shown that the SoF density index can be easily computed in closed form through a simple matrix inversion. Experiments on artificial and real datasets show that the proposed index performs well on finding dense regions, for graphs of various origins.
Keywords :
matrix inversion; nonparametric statistics; statistical distributions; trees (mathematics); Boltzmann probability distribution; SoF density index; dense region identification; high-density regions; low-cost trees; matrix inversion; matrix-forest theorem; nonparametric density index; statistical physics framework; sum-over-forests density index; Correlation; Equations; Indexes; Physics; Probability distribution; Vegetation; Data mining; Discrete Mathematics; Graph Theory; Graph mining; Mathematics of Computing; Mining methods and algorithms; Trees; dense regions on graphs; density index; matrix-forest theorem;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2013.227
Filename :
6656802
Link To Document :
بازگشت