Title :
Mining Query-Based Subnetwork Outliers in Heterogeneous Information Networks
Author :
Honglei Zhuang ; Jing Zhang ; Brova, George ; Jie Tang ; Hasan Cam ; Xifeng Yan ; Jiawei Han
Author_Institution :
Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Abstract :
Mining outliers in a heterogeneous information network is a challenging problem: It is even unclear what should be outliers in a large heterogeneous network (e.g., Outliers in the entire bibliographic network consisting of authors, titles, papers and venues). In this study, we propose an interesting class of outliers, query-based sub network outliers: Given a heterogeneous network, a user raises a query to retrieve a set of task-relevant sub networks, among which, sub network outliers are those that significantly deviate from others (e.g., Outliers of author groups among those studying "topic modeling"). We formalize this problem and propose a general framework, where one can query for finding sub network outliers with respect to different semantics. We introduce the notion of sub network similarity that captures the proximity between two sub networks by their membership distributions. We propose an outlier detection algorithm to rank all the sub networks according to their outlierness without tuning parameters. Our quantitative and qualitative experiments on both synthetic and real data sets show that the proposed method outperforms other baselines.
Keywords :
bibliographic systems; data mining; information networks; query processing; bibliographic network; heterogeneous information networks; membership distributions; outlier detection algorithm; query-based subnetwork outlier mining; subnetwork similarity; task-relevant subnetworks; topic modeling; tuning parameters; Computer science; Data mining; Linear programming; Mathematical model; Patents; Silicon; Vectors; heterogeneous information network; outlier detection; query-based;
Conference_Titel :
Data Mining (ICDM), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4303-6
DOI :
10.1109/ICDM.2014.85