• DocumentCode
    2371115
  • Title

    Frequent-pattern based iterative projected clustering

  • Author

    Yiu, Man Lung ; Mamoulis, Nikos

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Syst., Hong Kong Univ., China
  • fYear
    2003
  • fDate
    19-22 Nov. 2003
  • Firstpage
    689
  • Lastpage
    692
  • Abstract
    Irrelevant attributes add noise to high dimensional clusters and make traditional clustering techniques inappropriate. Projected clustering algorithms have been proposed to find the clusters in hidden subspaces. We realize the analogy between mining frequent itemsets and discovering the relevant subspace for a given cluster. We propose a methodology for finding projected clusters by mining frequent itemsets and present heuristics that improve its quality. Our techniques are evaluated with synthetic and real data; they are scalable and discover projected clusters accurately.
  • Keywords
    data mining; pattern clustering; statistical analysis; frequent itemset mining; hidden subspace; projected cluster discovery; projected clustering algorithm; real data; synthetic data; Character generation; Clustering algorithms; Computer science; Data mining; Databases; Information systems; Itemsets; Iterative algorithms; Lungs; Partitioning algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
  • Print_ISBN
    0-7695-1978-4
  • Type

    conf

  • DOI
    10.1109/ICDM.2003.1251009
  • Filename
    1251009