DocumentCode :
2080366
Title :
Discovery-driven graph summarization
Author :
Zhang, Ning ; Tian, Yuanyuan ; Patel, Jignesh M.
Author_Institution :
Comput. Sci. Dept., Univ. of Wisconsin-Madison, Madison, WI, USA
fYear :
2010
fDate :
1-6 March 2010
Firstpage :
880
Lastpage :
891
Abstract :
Large graph datasets are ubiquitous in many domains, including social networking and biology. Graph summarization techniques are crucial in such domains as they can assist in uncovering useful insights about the patterns hidden in the underlying data. One important type of graph summarization is to produce small and informative summaries based on user-selected node attributes and relationships, and allowing users to interactively drill-down or roll-up to navigate through summaries with different resolutions. However, two key components are missing from the previous work in this area that limit the use of this method in practice. First, the previous work only deals with categorical node attributes. Consequently, users have to manually bucketize numerical attributes based on domain knowledge, which is not always possible. Moreover, users often have to manually iterate through many resolutions of summaries to identify the most interesting ones. This paper addresses both these key issues to make the interactive graph summarization approach more useful in practice. We first present a method to automatically categorize numerical attributes values by exploiting the domain knowledge hidden inside the node attributes values and graph link structures. Furthermore, we propose an interestingness measure for graph summaries to point users to the potentially most insightful summaries. Using two real datasets, we demonstrate the effectiveness and efficiency of our techniques.
Keywords :
data handling; data mining; graph theory; pattern classification; categorical node attribute; graph link structures; graph summarization; Biology computing; Computer networks; Data mining; Databases; Facebook; Humans; Navigation; Pervasive computing; Social network services; Usability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2010 IEEE 26th International Conference on
Conference_Location :
Long Beach, CA
Print_ISBN :
978-1-4244-5445-7
Electronic_ISBN :
978-1-4244-5444-0
Type :
conf
DOI :
10.1109/ICDE.2010.5447830
Filename :
5447830
Link To Document :
بازگشت