Title :
Iterative projected clustering by subspace mining
Author :
Yiu, Man Lung ; Mamoulis, Nikos
Author_Institution :
Dept. of Comput. Sci. & Inf. Syst., Hong Kong Univ., China
Abstract :
Irrelevant attributes add noise to high-dimensional clusters and render traditional clustering techniques inappropriate. Recently, several algorithms that discover projected clusters and their associated subspaces have been proposed. We realize the analogy between mining frequent itemsets and discovering dense projected clusters around random points. Based on this, we propose a technique that improves the efficiency of a projected clustering algorithm (DOC). Our method is an optimized adaptation of the frequent pattern tree growth method used for mining frequent itemsets. We propose several techniques that employ the branch and bound paradigm to efficiently discover the projected clusters. An experimental study with synthetic and real data demonstrates that our technique significantly improves on the accuracy and speed of previous techniques.
Keywords :
Monte Carlo methods; data mining; database management systems; pattern classification; pattern clustering; tree searching; association rule; branch and bound paradigm; database management; frequent itemset mining; iterative projected clustering; pattern classification; projected clustering algorithm; subspace mining; tree growth method; Acoustic noise; Association rules; Clustering algorithms; Clustering methods; Data mining; Image databases; Itemsets; Iterative algorithms; Lungs; Optimization methods;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2005.29