• DocumentCode
    2423471
  • Title

    A Fast Subspace Clustering Algorithm Based on Pattern Similarity

  • Author

    Gan, Yanglan ; Guan, Jihong ; Wang, Hao

  • Author_Institution
    Tongji Univ., Shanghai
  • Volume
    3
  • fYear
    2007
  • fDate
    24-27 Aug. 2007
  • Firstpage
    253
  • Lastpage
    257
  • Abstract
    Traditional clustering models define similarity by distance over dimensions. However, distance functions are not always adequate in capturing correlations among the objects. Pattern-based clustering can discover this kind of clusters. But state-of-the-art pattern-based clustering methods are inefficient and haven´t criteria to evaluate the quality of clusters. This paper presents a novel pattern similarity-based subspace clustering with the pattern tree (PPSC for short) that offers these capabilities. The method uses new evaluation criteria to discover best clusters, which enables user to find clusters according to different needs. Meanwhile, observing the analogy between mining frequent itemsets and discovering subspace clusters around random points, we apply the pattern-tree to determine subspace by scanning the database once, so it can perform efficiently in large datasets.
  • Keywords
    pattern clustering; set theory; pattern similarity; pattern-based clustering; random points; similarity-based subspace clustering; Clustering algorithms; Clustering methods; Computer science; Data mining; Databases; Fuzzy systems; Gallium nitride; Itemsets; Large-scale systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
  • Conference_Location
    Haikou
  • Print_ISBN
    978-0-7695-2874-8
  • Type

    conf

  • DOI
    10.1109/FSKD.2007.24
  • Filename
    4406239