• DocumentCode
    2909464
  • Title

    An effective evolutionary algorithm for discrete-valued data clustering

  • Author

    Ma, Patrick C H ; Chan, Keith C C ; Yao, Xin

  • Author_Institution
    Dept. of Comput., Hong Kong Polytech. Univ., Hong Kong
  • fYear
    2008
  • fDate
    1-6 June 2008
  • Firstpage
    210
  • Lastpage
    216
  • Abstract
    Clustering is concerned with the discovery of interesting groupings of records in a database. Of the many algorithms have been developed to tackle clustering problems in a variety of application domains, a lot of effort has been put into the development of effective algorithms for handling spatial data. These algorithms were originally developed to handle continuous-valued attributes, and the distance functions such as the Euclidean distance measure are often used to measure the pair-wise similarity/distance between records so as to determine the cluster memberships of records. Since such distance functions cannot be validly defined in non-Euclidean space, these algorithms therefore cannot be used to handle databases that contain discrete-valued data. Owing to the fact that data in the real-life databases are always described by a set of descriptive attributes, many of which are not numerical or inherently ordered in any way, it is important that a clustering algorithm should be developed to handle data mining tasks involving them. In this paper, we propose an effective evolutionary clustering algorithm for this problem. For performance evaluation, we have tested the proposed algorithm using several real data sets. Experimental results show that it outperforms the existing algorithms commonly used for discrete-valued data clustering, and also, when dealing with mixed continuous- and discrete-valued data, its performance is also promising.
  • Keywords
    data mining; evolutionary computation; geometry; pattern clustering; visual databases; Euclidean distance measure; data mining; discrete-valued data clustering; evolutionary clustering algorithm; real-life databases; spatial data; Evolutionary computation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation, 2008. CEC 2008. (IEEE World Congress on Computational Intelligence). IEEE Congress on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4244-1822-0
  • Electronic_ISBN
    978-1-4244-1823-7
  • Type

    conf

  • DOI
    10.1109/CEC.2008.4630801
  • Filename
    4630801