Title :
A Novel Clustering Method Combining Heuristics and Information Theorem
Author :
Zhao, Zeng-shun ; Hou, Zeng-Guang ; Tan, Min
Author_Institution :
Coll. of Inf. & Electr. Eng., Shandong Univ. of Sci. & Technol., Qingdao, China
Abstract :
Many data mining tasks require the unsupervised partitioning of a data set into clusters. However, in many case we do not really know any prior knowledge about the clusters, for example, the density or the shape. This paper addresses two major issues associated with conventional competitive learning, namely, sensitivity to initialization and difficulty in determining the number of clusters. Many methods exist for such clustering, but most of then have assumed hyper-ellipsoidal clusters. Many heuristically proposed competitive learning methods and its variants, are somewhat ad hoc without any theoretical support. Under above considerations, we propose an algorithm named as Entropy guided Splitting Competitive Learning (ESCL) in the information theorem framework. Simulations show that minimization of partition entropy can be used to guide the competitive learning process, so to estimate the number and structure of probable data generators.
Keywords :
data mining; entropy; pattern clustering; unsupervised learning; clustering method; data mining; entropy guided splitting competitive learning; heuristics; hyper-ellipsoidal clusters; information theorem; initialization sensitivity; partition entropy; unsupervised data set partitioning; Clustering algorithms; Clustering methods; Data mining; Educational institutions; Entropy; Intelligent systems; Laboratories; Partitioning algorithms; Prototypes; Shape; Clustering; Competitive Learning; Entropy; Information Theorem;
Conference_Titel :
Natural Computation, 2009. ICNC '09. Fifth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3736-8
DOI :
10.1109/ICNC.2009.571