DocumentCode :
140740
Title :
Incremental cluster evolution tracking from highly dynamic network data
Author :
Pei Lee ; Lakshmanan, Laks V. S. ; Milios, Evangelos E.
Author_Institution :
Comput. Sci. Dept., Univ. of British Columbia, Vancouver, BC, Canada
fYear :
2014
fDate :
March 31 2014-April 4 2014
Firstpage :
3
Lastpage :
14
Abstract :
Dynamic networks are commonly found in the current web age. In scenarios like social networks and social media, dynamic networks are noisy, are of large-scale and evolve quickly. In this paper, we focus on the cluster evolution tracking problem on highly dynamic networks, with clear application to event evolution tracking. There are several previous works on data stream clustering using a node-by-node approach for maintaining clusters. However, handling of bulk updates, i.e., a subgraph at a time, is critical for achieving acceptable performance over very large highly dynamic networks. We propose a subgraph-by-subgraph incremental tracking framework for cluster evolution in this paper. To effectively illustrate the techniques in our framework, we consider the event evolution tracking task in social streams as an application, where a social stream and an event are modeled as a dynamic post network and a dynamic cluster respectively. By monitoring through a fading time window, we introduce a skeletal graph to summarize the information in the dynamic network, and formalize cluster evolution patterns using a group of primitive evolution operations and their algebra. Two incremental computation algorithms are developed to maintain clusters and track evolution patterns as time rolls on and the network evolves. Our detailed experimental evaluation on large Twitter datasets demonstrates that our framework can effectively track the complete set of cluster evolution patterns from highly dynamic networks on the fly.
Keywords :
algebra; evolutionary computation; learning (artificial intelligence); network theory (graphs); pattern clustering; social networking (online); Twitter datasets; algebra; bulk update handling; cluster evolution patterns; cluster evolution tracking problem; cluster maintenance; data stream clustering; dynamic networks; event evolution tracking; fading time window; incremental cluster evolution tracking; network data; node-by-node approach; primitive evolution operations; skeletal graph; social media; social networks; social stream; subgraph-by-subgraph incremental tracking framework; Clustering algorithms; Fading; Heuristic algorithms; Monitoring; Noise; Robustness; Twitter;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2014 IEEE 30th International Conference on
Conference_Location :
Chicago, IL
Type :
conf
DOI :
10.1109/ICDE.2014.6816635
Filename :
6816635
Link To Document :
بازگشت