• DocumentCode
    907895
  • Title

    A human-computer interactive method for projected clustering

  • Author

    Aggarwal, Charu C.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
  • Volume
    16
  • Issue
    4
  • fYear
    2004
  • fDate
    4/1/2004 12:00:00 AM
  • Firstpage
    448
  • Lastpage
    460
  • Abstract
    Clustering is a central task in data mining applications such as customer segmentation. High-dimensional data has always been a challenge for clustering algorithms because of the inherent sparsity of the points. Therefore, techniques have recently been proposed to find clusters in hidden subspaces of the data. However, since the behavior of the data can vary considerably in different subspaces, it is often difficult to define the notion of a cluster with the use of simple mathematical formalizations. The widely used practice of treating clustering as the exact problem of optimizing an arbitrarily chosen objective function can often lead to misleading results. In fact, the proper clustering definition may vary not only with the application and data set but also with the perceptions of the end user. This makes it difficult to separate the definition of the clustering problem from the perception of an end-user. We propose a system, which performs high-dimensional clustering by cooperation between the human and the computer. The complex task of cluster creation is accomplished through a combination of human intuition and the computational support provided by the computer. The result is a system, which leverages the best abilities of both the human and the computer for solving the clustering problem.
  • Keywords
    cooperative systems; data mining; human computer interaction; pattern clustering; very large databases; computational support; end-user perception; high-dimensional data mining; human intuition; human-computer interactive method; projected clustering algorithms; Application software; Clustering algorithms; Data mining; Helium; High performance computing; Humans; Multidimensional systems; Pattern analysis; Pattern recognition; Shape;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2004.1269669
  • Filename
    1269669