• DocumentCode
    477803
  • Title

    Discovering the Skyline of Subspace Clusters in High-Dimensional Data

  • Author

    Chen, Guanhua ; Ma, Xiuli ; Yang, Dongqing ; Tang, Shiwei

  • Author_Institution
    Sch. of Electron. Eng. & Comput. Sci., Peking Univ., Beijing
  • Volume
    2
  • fYear
    2008
  • fDate
    18-20 Oct. 2008
  • Firstpage
    439
  • Lastpage
    443
  • Abstract
    Subspace clustering on high-dimensional datasets may often result in an undesirably large set of clusters due to the huge amount of possible subspaces. Such a large set of subspace clusters not only raises the cost of computation, but also weaken the understandability of the results. Both of the two problems reduce the usability of the subspace clustering in the real applications. In this paper, we propose a new approach of applying skyline query into the subspace clustering process, for avoiding redundant subspace clusters by the dominating relationship, which is characterized as mining the skyline of subspace clusters. Two algorithms, SkyClu-CBC and SkyClu-IBC, are proposed. Experiments on real and synthetic datasets are carried out to show the effectiveness and efficiency of the proposed methods.
  • Keywords
    data mining; pattern clustering; query formulation; SkyClu-CBC; SkyClu-IBC; data mining; high-dimensional data; skyline query; subspace clusters; Clustering algorithms; Computational efficiency; Computer science; Computer science education; Consumer electronics; Data engineering; Fuzzy systems; Knowledge engineering; Laboratories; Systems engineering education; high-dimensional data; skyline; subspace clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
  • Conference_Location
    Shandong
  • Print_ISBN
    978-0-7695-3305-6
  • Type

    conf

  • DOI
    10.1109/FSKD.2008.489
  • Filename
    4666155