Title :
A rough set based clustering algorithm and the information theoretical approach to refine clusters
Author :
Wang, Qingdong ; Dai, Huaping ; Sun, Youxian
Author_Institution :
Nat. Key Lab. of Ind. Control Technol., Zhejiang Univ., Hangzhou, China
Abstract :
In many clustering processes, the presence of more information does not usually generate a corresponding increase in the performance of clustering. The presence of irrelevant information decreases the effectiveness of the clustering algorithm. We propose a solution to improve the quality of clustering that is an attribute-weighted clustering algorithm based on rough set theory and the information theoretical refinement process. Firstly, we give every attribute the same weight value, and use rough set based clustering algorithm to get the initial classes. Then we weigh every attribute by Shannon´s Entropy Theory, substitute mutual entropy values for the weight of every attribute, and compute with attribute-weighted rough set clustering algorithm again to refine and improve the clustering result. We have tested our algorithm on data sets from UCI repository. The experimental results show that our algorithm can obtain better results in classification rate and purity of classes than other traditional clustering methods.
Keywords :
entropy; pattern clustering; rough set theory; statistical analysis; Shannon entropy theory; UCI repository; attribute weighted clustering algorithm; clustering processes; information theoretical refinement process; irrelevant information; rough set theory; Clustering algorithms; Clustering methods; Entropy; Industrial control; Laboratories; Refining; Set theory; Sun; Testing;
Conference_Titel :
Intelligent Control and Automation, 2004. WCICA 2004. Fifth World Congress on
Print_ISBN :
0-7803-8273-0
DOI :
10.1109/WCICA.2004.1342320