Title :
HBGSim: A structural similarity measurement over heterogeneous big graphs
Author :
Jiazhen Nian ; Shan Jiang ; Yan Zhang
Author_Institution :
Dept. of Machine Intell., Peking Univ., Beijing, China
Abstract :
Similarity measurement is fundamental to many data mining and information retrieval tasks such as link prediction and relevance-based search. Conventional similarity measurement relies more on homogenous linkage relation and content information. However, these measurements cannot take full advantage of the data structure as heterogenous graph gains increasing popularity. Moreover, the scalability of these methods also faces challenge with the never-ending growth of big data in real world. In this paper, we propose a new similarity measurement called HBGSim based on the heterogeneous structured data. HBGSim combines both local and global features by a two-stage process. We make a comparison between our measurement and some traditional methods on DBLP1 dataset for evaluation and the experimental results show that our method outperforms the others.
Keywords :
data structures; graph theory; pattern matching; DBLP dataset; HBGSim; heterogeneous big graphs; heterogeneous structured data; structural similarity measurement; Avatars; Data mining; Feature extraction; Nonhomogeneous media; Partitioning algorithms; Semantics; Vectors; Heterogeneous graph; Heterogeneous neighbors; Structural similarity measurement;
Conference_Titel :
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/BigData.2014.7004465