DocumentCode :
610389
Title :
gIceberg: Towards iceberg analysis in large graphs
Author :
Nan Li ; Ziyu Guan ; Lijie Ren ; Jian Wu ; Jiawei Han ; Xifeng Yan
Author_Institution :
Dept. of Comput. Sci., Univ. of California at Santa Barbara, Santa Barbara, CA, USA
fYear :
2013
fDate :
8-12 April 2013
Firstpage :
1021
Lastpage :
1032
Abstract :
Traditional multi-dimensional data analysis techniques such as iceberg cube cannot be directly applied to graphs for finding interesting or anomalous vertices due to the lack of dimensionality in graphs. In this paper, we introduce the concept of graph icebergs that refer to vertices for which the concentration (aggregation) of an attribute in their vicinities is abnormally high. Intuitively, these vertices shall be “close” to the attribute of interest in the graph space. Based on this intuition, we propose a novel framework, called gIceberg, which performs aggregation using random walks, rather than traditional SUM and AVG aggregate functions. This proposed framework scores vertices by their different levels of interestingness and finds important vertices that meet a user-specified threshold. To improve scalability, two aggregation strategies, forward and backward aggregation, are proposed with corresponding optimization techniques and bounds. Experiments on both real-world and synthetic large graphs demonstrate that gIceberg is effective and scalable.
Keywords :
data analysis; graph theory; optimisation; attribute aggregation; attribute concentration; backward aggregation strategy; forward aggregation strategy; gIceberg framework; graph dimensionality; graph icebergs concept; graph vertex; iceberg analysis; iceberg cube; multidimensional data analysis technique; optimization technique; random walk; user-specified threshold; Accuracy; Aggregates; Approximation algorithms; Approximation methods; Educational institutions; Equations; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2013 IEEE 29th International Conference on
Conference_Location :
Brisbane, QLD
ISSN :
1063-6382
Print_ISBN :
978-1-4673-4909-3
Electronic_ISBN :
1063-6382
Type :
conf
DOI :
10.1109/ICDE.2013.6544894
Filename :
6544894
Link To Document :
بازگشت