DocumentCode :
2005050
Title :
Active sampling for constrained clustering
Author :
Okabe, Masayuki ; Yamada, Shigeru
Author_Institution :
Inf. & Media Center, Toyohashi Univ. of Technol., Toyohashi, Japan
fYear :
2012
fDate :
20-24 Nov. 2012
Firstpage :
399
Lastpage :
402
Abstract :
Constrained Clustering is a framework of improving clustering performance by using supervised information, which is generally a set of constraints about data pairs. Since performance of constrained clustering depends on a set of constraints to use, we need a method to select good constraints that are expected to promote clustering performance. In this paper, we propose such a method, which actively select data pairs to be constrained by using variance of clustering iteration. This method consists of a bagging based cluster ensemble algorithm that integrates a set of clusters produced by a constrained k-means with random ordered data assignment. Experimental results show that our method outperforms clustering with random sampling method.
Keywords :
learning (artificial intelligence); pattern clustering; performance evaluation; random processes; sampling methods; active sampling; bagging-based cluster ensemble algorithm; clustering iteration variance; clustering performance improvement; constrained clustering; constrained k-means; constraint selection method; data pair selection; random ordered data assignment; supervised information;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Soft Computing and Intelligent Systems (SCIS) and 13th International Symposium on Advanced Intelligent Systems (ISIS), 2012 Joint 6th International Conference on
Conference_Location :
Kobe
Print_ISBN :
978-1-4673-2742-8
Type :
conf
DOI :
10.1109/SCIS-ISIS.2012.6505193
Filename :
6505193
Link To Document :
بازگشت