Title :
Boosting Unsupervised Additive Clustering Using Cluster-Wise Optimization and Multi-label Learning
Author :
France, Stephen L. ; Abbasi, Ahmed
Author_Institution :
Lubar Sch. of Bus., Univ. of Wisconsin-Milwaukee, Milwaukee, WI, USA
Abstract :
Additive or overlapping clustering is a technique that is used to analyze overlapping cluster structure in data. In this paper, we motivate the overlapping clustering problem using an example of categorizing movies. We describe the ADCLUS and INDCLUS overlapping clustering models as discrete versions of the CANDECOMP/PARAFAC models. We describe the scalability problems inherent in current overlapping clustering approaches. We give a framework and algorithm for scaling up unsupervised overlapping clustering using a combination of a cluster-wise optimization technique and techniques from multi-label learning. Our framework uses a subset of data to find a training solution and then uses multi-label techniques to find labels for the remaining data.
Keywords :
data handling; learning (artificial intelligence); optimisation; pattern clustering; ADCLUS; CANDECOMP models; INDCLUS; PARAFAC models; boosting unsupervised additive clustering; clusterwise optimization; clusterwise optimization technique; data cluster structure; multilabel learning; overlapping clustering; Additives; Algorithm design and analysis; Clustering algorithms; Motion pictures; Optimization; Scalability; Training; Additive; clustering; multi-label; supervised; unsupervised;
Conference_Titel :
Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4673-0005-6
DOI :
10.1109/ICDMW.2011.40