• DocumentCode
    1122006
  • Title

    Cluster Definition by the Optimization of Simple Measures

  • Author

    Bailey, Thomas ; Cowles, John

  • Author_Institution
    Department of Computer Science, University of Wyoming, Laramie, WY 82071.
  • Issue
    5
  • fYear
    1984
  • Firstpage
    645
  • Lastpage
    652
  • Abstract
    We adopt the following measures of clustering based on simple edge counts in an undirected loop-free graph. Let S be a subset of the points of the graph. The compactness of S is measured by the number of edges connecting points in S to other points in S. The isolation or separation of S is measured by the negative of the number of edges connecting points in S to points not in S. The subset S is a cluster if it is compact and isolated. We study the cluster search problem: find a subset S which maximizes a linear combination of the compactness and (negative) isolation edge counts. We show that a closely related decision problem is NP-complete. We develop a pruned search tree algorithm which is much faster than complete search, especially for graphs which are derived from points embedded in a space of low dimensionality.
  • Keywords
    Clustering algorithms; Clustering methods; Computer science; Data analysis; Graph theory; Joining processes; NP-complete problem; Search problems; Tree graphs; Cluster definition; NP-complete problems; cluster validity; clustering; graph-theoretical clustering; intrinsic dimensionality; tree-search algorithms;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.1984.4767579
  • Filename
    4767579