• DocumentCode
    3228026
  • Title

    A New Supervised Clustering Algorithm for Data Set with Mixed Attributes

  • Author

    Li, Shijin ; Liu, Jing ; Zhu, Yuelong ; Zhang, Xiaohua

  • Author_Institution
    Hohai Univ., Nanjing
  • Volume
    2
  • fYear
    2007
  • fDate
    July 30 2007-Aug. 1 2007
  • Firstpage
    844
  • Lastpage
    849
  • Abstract
    Because of the complexity of data set with mixed attributes, the traditional clustering algorithms appropriate for this kind of dataset are few and the effect of clustering is not good. K-prototype clustering is one of the most commonly used methods in data mining for this kind of data. We borrow the ideas from the multiple classifiers combing technology, use k- prototype as the basis clustering algorithm to design a multi-level clustering ensemble algorithm in this paper, which adoptively selects attributes for re-clustering. Comparison experiments on Adult data set from UCI machine learning data repository show very competitive results and the proposed method is suitable for data editing.
  • Keywords
    data mining; learning (artificial intelligence); pattern classification; pattern clustering; text editing; K-prototype clustering; UCI machine learning data repository; clustering algorithms; data editing; data mining; multilevel clustering ensemble algorithm; multiple classifiers combing technology; supervised clustering algorithm; Algorithm design and analysis; Clustering algorithms; Clustering methods; Data mining; Distributed computing; Machine learning; Machine learning algorithms; Partitioning algorithms; Software algorithms; Software engineering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2007. SNPD 2007. Eighth ACIS International Conference on
  • Conference_Location
    Qingdao
  • Print_ISBN
    978-0-7695-2909-7
  • Type

    conf

  • DOI
    10.1109/SNPD.2007.360
  • Filename
    4287799