Title :
Validating clusters using the Hopkins statistic
Author :
Banerjee, Amit ; Davé, Rajesh N.
Author_Institution :
Dept. of Mechanical Eng., New Jersey Inst. of Technol., Newark, NJ, USA
Abstract :
A novel scheme for cluster validity using a test for random position hypothesis is proposed. The random position hypothesis is tested against an alternative clustered hypothesis on every cluster produced by a partitioning algorithm. A test statistic such as the well-known Hopkins statistic could be used as a basis to accept or reject the random position hypothesis, which is also the hypothesis in this case. The Hopkins statistic is known to be a fair estimator of randomness in a data set. The concept is borrowed from the clustering tendency domain and its applicability to validating clusters is shown here using two artificially constructed test data sets.
Keywords :
pattern clustering; random processes; set theory; Hopkins statistic; cluster validity; fair estimator; hypothesis; partitioning algorithm; random position hypothesis; Clustering algorithms; Labeling; Mechanical engineering; Mechanical variables measurement; Partitioning algorithms; Statistical analysis; Statistics; Testing;
Conference_Titel :
Fuzzy Systems, 2004. Proceedings. 2004 IEEE International Conference on
Print_ISBN :
0-7803-8353-2
DOI :
10.1109/FUZZY.2004.1375706