• DocumentCode
    20889
  • Title

    Static and Dynamic Structural Correlations in Graphs

  • Author

    Jian Wu ; Ziyu Guan ; Qing Zhang ; Singh, A.K. ; Xifeng Yan

  • Author_Institution
    Coll. of Comput. Sci., Zhejiang Univ., Hangzhou, China
  • Volume
    25
  • Issue
    9
  • fYear
    2013
  • fDate
    Sept. 2013
  • Firstpage
    2147
  • Lastpage
    2160
  • Abstract
    Real-life graphs not only contain nodes and edges, but also have events taking place, e.g., product sales in social networks. Among different events, some exhibit strong correlations with the network structure, while others do not. Such structural correlations will shed light on viral influence existing in the corresponding network. Unfortunately, the traditional association mining concept is not applicable in graphs because it only works on homogeneous data sets like transactions and baskets. We propose a novel measure for assessing such structural correlations in heterogeneous graph data sets with events. The measure applies hitting time to aggregate the proximity among nodes that have the same event. To calculate the correlation scores for many events in a large network, we develop a scalable framework, called gScore, using sampling and approximation. By comparing to the situation where events are randomly distributed in the same network, our method is able to discover events that are highly correlated with the graph structure. We test gScore´s effectiveness by synthetic events on the DBLP coauthor network and report interesting correlation results in a social network extracted from TaoBao.com, the largest online shopping network in China. Scalability of gScore is tested on the Twitter network. Since an event is essentially a temporal phenomenon, we also propose a dynamic measure, which reveals structural correlations at specific time steps and can be used for discovering detailed evolutionary patterns.
  • Keywords
    Internet; approximation theory; correlation methods; data handling; graph theory; network theory (graphs); random processes; retail data processing; sampling methods; social networking (online); China; DBLP coauthor network; TaoBao.com; Twitter network; approximation method; dynamic measure; dynamic structural correlation; evolutionary patterns; gScore; graph structure; heterogeneous graph data sets; hitting time; network structure; online shopping network; random event distribution; sampling method; scalable framework; social network; static structural correlation; structural correlation score; synthetic events; temporal phenomenon; Approximation methods; Correlation; Electronic mail; Marketing and sales; Q measurement; Social network services; Time measurement; Graph; hitting time; structural correlation;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2012.133
  • Filename
    6226407