• DocumentCode
    575000
  • Title

    Distributed parallel adaptive clustering algorithm based on Clique and high dimensionality reduction

  • Author

    LinJia Qin

  • Author_Institution
    Shanghai Univ., Shanghai, China
  • fYear
    2011
  • fDate
    Nov. 29 2011-Dec. 1 2011
  • Firstpage
    352
  • Lastpage
    357
  • Abstract
    With the fast development of storage technologies, large-scale and high dimensional datasets are stored in a distributed way. It usually applies distributed clustering algorithms to cluster distributed datasets. This paper presents a distributed clustering algorithm based on Clique and high dimensionality reduction to do the distributed clustering. Moreover, the efficiency, accuracy and extendibility of clustering analysis are improved by self-adapting algorithms and the assistant of data and mission parallelism in master or child node. Through experiments, we show that DPA-CLIQU efficiently finds accurate clusters in large high dimensional datasets from a distributed system.
  • Keywords
    parallel programming; pattern clustering; storage management; Clique; distributed parallel adaptive clustering; high dimensional datasets; high dimensionality reduction; large-scale datasets; storage technologies; Algorithm design and analysis; Clustering algorithms; Correlation; Data systems; Distributed databases; Parallel processing; Partitioning algorithms; Clique; Clustering analysis; Data mining; Distributed Clustering; Distributed data system;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Sciences and Convergence Information Technology (ICCIT), 2011 6th International Conference on
  • Conference_Location
    Seogwipo
  • Print_ISBN
    978-1-4577-0472-7
  • Type

    conf

  • Filename
    6316636