DocumentCode :
3228026
Title :
A New Supervised Clustering Algorithm for Data Set with Mixed Attributes
Author :
Li, Shijin ; Liu, Jing ; Zhu, Yuelong ; Zhang, Xiaohua
Author_Institution :
Hohai Univ., Nanjing
Volume :
2
fYear :
2007
fDate :
July 30 2007-Aug. 1 2007
Firstpage :
844
Lastpage :
849
Abstract :
Because of the complexity of data set with mixed attributes, the traditional clustering algorithms appropriate for this kind of dataset are few and the effect of clustering is not good. K-prototype clustering is one of the most commonly used methods in data mining for this kind of data. We borrow the ideas from the multiple classifiers combing technology, use k- prototype as the basis clustering algorithm to design a multi-level clustering ensemble algorithm in this paper, which adoptively selects attributes for re-clustering. Comparison experiments on Adult data set from UCI machine learning data repository show very competitive results and the proposed method is suitable for data editing.
Keywords :
data mining; learning (artificial intelligence); pattern classification; pattern clustering; text editing; K-prototype clustering; UCI machine learning data repository; clustering algorithms; data editing; data mining; multilevel clustering ensemble algorithm; multiple classifiers combing technology; supervised clustering algorithm; Algorithm design and analysis; Clustering algorithms; Clustering methods; Data mining; Distributed computing; Machine learning; Machine learning algorithms; Partitioning algorithms; Software algorithms; Software engineering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2007. SNPD 2007. Eighth ACIS International Conference on
Conference_Location :
Qingdao
Print_ISBN :
978-0-7695-2909-7
Type :
conf
DOI :
10.1109/SNPD.2007.360
Filename :
4287799
Link To Document :
بازگشت