DocumentCode
588704
Title
A Label-Based Partitioning Strategy for Mining Link Patterns
Author
Cuifang Zhao ; Xiang Zhang ; Peng Wang
Author_Institution
Sch. of Comput. Sci. & Eng., Southeast Univ., Nanjing, China
fYear
2012
fDate
8-10 Nov. 2012
Firstpage
203
Lastpage
206
Abstract
As the explosive growth of online linked data, the task of mining link patterns attracts more and more attention. A practical issue is how to perform mining efficiently in large-scale linked data. Existing pattern mining algorithms usually assume that the dataset can fit into the main memory, while linked data of billion triples is far beyond the memory limitation. In this paper we give a pilot study of a novel partitioning strategy for mining link patterns in large-scale linked data. First, we propose an algorithm named Par Group to divide and group large linked data to partitions based on vertex label, Second, an adapted gSpan is applied for mining link patterns in each partition, At last, discovered link patterns are merged into a global result set. Experiments show that our strategy is feasible and promising in some scenarios.
Keywords
data mining; graph theory; merging; pattern clustering; ParGroup algorithm; gSpan; label-based partitioning strategy; large linked data group; large-scale linked data mining; link pattern mining; online linked data; pattern merging; pattern mining algorithms; vertex label; Algorithm design and analysis; Data mining; Databases; Educational institutions; Merging; Partitioning algorithms; Resource description framework; Graph Clustering; Graph Partition; Pattern Mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Knowledge, Information and Creativity Support Systems (KICSS), 2012 Seventh International Conference on
Conference_Location
Melbourne, VIC
Print_ISBN
978-1-4673-4564-4
Type
conf
DOI
10.1109/KICSS.2012.15
Filename
6405530
Link To Document