• DocumentCode
    588704
  • Title

    A Label-Based Partitioning Strategy for Mining Link Patterns

  • Author

    Cuifang Zhao ; Xiang Zhang ; Peng Wang

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Southeast Univ., Nanjing, China
  • fYear
    2012
  • fDate
    8-10 Nov. 2012
  • Firstpage
    203
  • Lastpage
    206
  • Abstract
    As the explosive growth of online linked data, the task of mining link patterns attracts more and more attention. A practical issue is how to perform mining efficiently in large-scale linked data. Existing pattern mining algorithms usually assume that the dataset can fit into the main memory, while linked data of billion triples is far beyond the memory limitation. In this paper we give a pilot study of a novel partitioning strategy for mining link patterns in large-scale linked data. First, we propose an algorithm named Par Group to divide and group large linked data to partitions based on vertex label, Second, an adapted gSpan is applied for mining link patterns in each partition, At last, discovered link patterns are merged into a global result set. Experiments show that our strategy is feasible and promising in some scenarios.
  • Keywords
    data mining; graph theory; merging; pattern clustering; ParGroup algorithm; gSpan; label-based partitioning strategy; large linked data group; large-scale linked data mining; link pattern mining; online linked data; pattern merging; pattern mining algorithms; vertex label; Algorithm design and analysis; Data mining; Databases; Educational institutions; Merging; Partitioning algorithms; Resource description framework; Graph Clustering; Graph Partition; Pattern Mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Knowledge, Information and Creativity Support Systems (KICSS), 2012 Seventh International Conference on
  • Conference_Location
    Melbourne, VIC
  • Print_ISBN
    978-1-4673-4564-4
  • Type

    conf

  • DOI
    10.1109/KICSS.2012.15
  • Filename
    6405530