Title :
A hierarchical projection pursuit clustering algorithm
Author :
Miasnikov, Alexei D. ; Rome, Jayson E. ; Haralick, Robert M.
Author_Institution :
Dept. of Comput. Sci., City Univ. of New York, NY, USA
Abstract :
We define a cluster to be characterized by regions of high density separated by regions that are sparse. By observing the downward closure property of density, the search for interesting structure in a high dimensional space can be reduced to a search for structure in lower dimensional subspaces. We present a hierarchical projection pursuit clustering (HPPC) algorithm that repeatedly bi-partitions the dataset based on the discovered properties of interesting 1-dimensional projections. We describe a projection search procedure and a projection pursuit index function based on Cho, Haralick and Yi´s improvement of the Kittler and Illingworth optimal threshold technique. The output of the algorithm is a decision tree whose nodes store a projection and threshold and whose leaves represent the clusters (classes). Experiments with various real and synthetic datasets show the effectiveness of the approach.
Keywords :
decision trees; pattern clustering; statistical analysis; 1D projections; decision tree; hierarchical projection pursuit clustering algorithm; high dimensional space; lower dimensional subspaces; optimal threshold technique; projection pursuit index function; projection search procedure; Binary trees; Classification tree analysis; Clustering algorithms; Computer science; Decision trees; Indexing; Partitioning algorithms; Pattern recognition; Pursuit algorithms; Shape;
Conference_Titel :
Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on
Print_ISBN :
0-7695-2128-2
DOI :
10.1109/ICPR.2004.1334104