DocumentCode
3228026
Title
A New Supervised Clustering Algorithm for Data Set with Mixed Attributes
Author
Li, Shijin ; Liu, Jing ; Zhu, Yuelong ; Zhang, Xiaohua
Author_Institution
Hohai Univ., Nanjing
Volume
2
fYear
2007
fDate
July 30 2007-Aug. 1 2007
Firstpage
844
Lastpage
849
Abstract
Because of the complexity of data set with mixed attributes, the traditional clustering algorithms appropriate for this kind of dataset are few and the effect of clustering is not good. K-prototype clustering is one of the most commonly used methods in data mining for this kind of data. We borrow the ideas from the multiple classifiers combing technology, use k- prototype as the basis clustering algorithm to design a multi-level clustering ensemble algorithm in this paper, which adoptively selects attributes for re-clustering. Comparison experiments on Adult data set from UCI machine learning data repository show very competitive results and the proposed method is suitable for data editing.
Keywords
data mining; learning (artificial intelligence); pattern classification; pattern clustering; text editing; K-prototype clustering; UCI machine learning data repository; clustering algorithms; data editing; data mining; multilevel clustering ensemble algorithm; multiple classifiers combing technology; supervised clustering algorithm; Algorithm design and analysis; Clustering algorithms; Clustering methods; Data mining; Distributed computing; Machine learning; Machine learning algorithms; Partitioning algorithms; Software algorithms; Software engineering;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2007. SNPD 2007. Eighth ACIS International Conference on
Conference_Location
Qingdao
Print_ISBN
978-0-7695-2909-7
Type
conf
DOI
10.1109/SNPD.2007.360
Filename
4287799
Link To Document