• DocumentCode
    2872805
  • Title

    A Novel Clustering Method Combining Heuristics and Information Theorem

  • Author

    Zhao, Zeng-shun ; Hou, Zeng-Guang ; Tan, Min

  • Author_Institution
    Coll. of Inf. & Electr. Eng., Shandong Univ. of Sci. & Technol., Qingdao, China
  • Volume
    2
  • fYear
    2009
  • fDate
    14-16 Aug. 2009
  • Firstpage
    339
  • Lastpage
    343
  • Abstract
    Many data mining tasks require the unsupervised partitioning of a data set into clusters. However, in many case we do not really know any prior knowledge about the clusters, for example, the density or the shape. This paper addresses two major issues associated with conventional competitive learning, namely, sensitivity to initialization and difficulty in determining the number of clusters. Many methods exist for such clustering, but most of then have assumed hyper-ellipsoidal clusters. Many heuristically proposed competitive learning methods and its variants, are somewhat ad hoc without any theoretical support. Under above considerations, we propose an algorithm named as Entropy guided Splitting Competitive Learning (ESCL) in the information theorem framework. Simulations show that minimization of partition entropy can be used to guide the competitive learning process, so to estimate the number and structure of probable data generators.
  • Keywords
    data mining; entropy; pattern clustering; unsupervised learning; clustering method; data mining; entropy guided splitting competitive learning; heuristics; hyper-ellipsoidal clusters; information theorem; initialization sensitivity; partition entropy; unsupervised data set partitioning; Clustering algorithms; Clustering methods; Data mining; Educational institutions; Entropy; Intelligent systems; Laboratories; Partitioning algorithms; Prototypes; Shape; Clustering; Competitive Learning; Entropy; Information Theorem;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation, 2009. ICNC '09. Fifth International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-0-7695-3736-8
  • Type

    conf

  • DOI
    10.1109/ICNC.2009.571
  • Filename
    5366770