• DocumentCode
    533148
  • Title

    Study of high dimensional clustering algorithm based on graph partition

  • Author

    Yuan Gao

  • Author_Institution
    Sch. of Electron. & Comput. Sci. & Technol., North Univ. of China, Taiyuan, China
  • Volume
    13
  • fYear
    2010
  • fDate
    22-24 Oct. 2010
  • Abstract
    In many clustering applications, the data sets are high-dimensional, sparse and binary, resulting to the failure of traditional algorithms in handling these data. In this paper, we present a new clustering algorithm based on graph partition for high-dimensional data, which, by defining the feature vector of attribute-value distribution and the similarity of attribution-value distribution, and creating a sequence of smaller and smaller coarse graphs from the original base graph. The smallest coarse graph is then partitioned using a spectral method, and this partition is propagated back through the hierarchy of graphs. Thus, the corresponding data items in each partition are highly related. The analysis demonstrates that this algorithm is effective in clustering knowledge discover.
  • Keywords
    data handling; data mining; graph theory; pattern clustering; attribute value distribution; data handling; graph partition; high dimensional clustering algorithm; knowledge discover; Algorithm design and analysis; Clustering algorithms; Computational modeling; Data models; Marketing and sales; Partitioning algorithms; Data mining; Graph partition; High-dimensional clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Application and System Modeling (ICCASM), 2010 International Conference on
  • Conference_Location
    Taiyuan
  • Print_ISBN
    978-1-4244-7235-2
  • Electronic_ISBN
    978-1-4244-7237-6
  • Type

    conf

  • DOI
    10.1109/ICCASM.2010.5622890
  • Filename
    5622890