DocumentCode :
2476978
Title :
Transfer clustering via constraints generated from topics
Author :
Yu, Litao ; Dang, Yanzhong ; Yang, Guangfei
Author_Institution :
Inst. of Syst. Eng., Dalian Univ. of Technol., Dalian, China
fYear :
2012
fDate :
14-17 Oct. 2012
Firstpage :
3203
Lastpage :
3208
Abstract :
Clustering technique is widely used in data mining like gene-microarray analysis and natural language processing. When there are sufficient data samples and good representations, traditional clustering algorithms such as K-means can work well. But when the number of samples is small and the data representation is bad, direct use of clustering may yield bad results. In this paper we propose a new algorithm TCTC(Topic-Constraint Transfer Clustering), which is an instance of unsupervised transfer learning, to cluster a small number of unlabeled data with the help of sufficient and better represented auxiliary data. First several latent topics are extracted from the clusters of the auxiliary data. Then the affinities between target data samples and topics are discovered to “guide” the disseminated data clustering. Finally semi-supervised clustering algorithm is applied on target data. The experiments demonstrate our method is quite effective to solve the problem of disseminated and ill-presented data clustering.
Keywords :
data mining; data structures; lab-on-a-chip; learning (artificial intelligence); natural language processing; pattern clustering; TCTC; data clustering; data mining; gene-microarray analysis; natural language processing; semisupervised clustering algorithm; topic generated constraints; topic-constraint transfer clustering; transfer clustering technique; unlabeled data; unsupervised transfer learning; Algorithm design and analysis; Bridges; Clustering algorithms; Data mining; Entropy; Equations; Mathematical model; Unsupervised transfer learning; semi-supervised clustering; topic transfer;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4673-1713-9
Electronic_ISBN :
978-1-4673-1712-2
Type :
conf
DOI :
10.1109/ICSMC.2012.6378284
Filename :
6378284
Link To Document :
بازگشت