Title :
Semi-supervised cluster ensemble based on binary similarity matrix
Author :
Wang, Hongjun ; Qi, Jianhuai ; Zheng, Weifan ; Wang, Mingwen
Author_Institution :
Inf. Res. Inst., SouthWest Jiaotong Univ., Chengdu, China
Abstract :
The paper introduces a semi-supervised cluster ensemble of pairwised constrains based on the binary similarity matrix. Pairwised constrains are the typical way of semi-supervised learning. Cluster ensemble can increase robustness of clustering and it is helpful for knowledge reuse and distributed computing. The existing algorithms are mostly unsupervised algorithms of cluster ensemble which can´t take advantages of known information ofdatasets. As a result the precision, robustness and stability of cluster ensemble are degraded. Semi-supervised cluster ensemble may conquer these disadvantages. The idea is that we use pairwised constrains as semi-supervised learning for semi-supervised cluster ensemble, in this paper there are three works presented. First, we state a semi-supervised cluster ensemble method. Second, the model of semi-supervised cluster ensemble is illustrated in detail. Third, some UCI datasets are chosen for the experiments, and the results show that semi-supervised cluster ensemble works well.
Keywords :
learning (artificial intelligence); matrix algebra; pattern clustering; binary similarity matrix; distributed computing; knowledge reuse; pairwised constraints; semi-supervised cluster ensemble; semi-supervised learning; Clustering algorithms; Data privacy; Distributed computing; Machine learning; Machine learning algorithms; Robust stability; Robustness; Semisupervised learning; Training data; Unsupervised learning; clustering; semi-supervised cluster ensemble;
Conference_Titel :
Information Management and Engineering (ICIME), 2010 The 2nd IEEE International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-5263-7
Electronic_ISBN :
978-1-4244-5265-1
DOI :
10.1109/ICIME.2010.5478054