DocumentCode :
2771527
Title :
Relevant Subspace Clustering: Mining the Most Interesting Non-redundant Concepts in High Dimensional Data
Author :
Muller, E. ; Assent, Ira ; Gunnemann, Stephan ; Krieger, Ralph ; Seidl, Thomas
Author_Institution :
RWTH Aachen Univ., Aachen, Germany
fYear :
2009
fDate :
6-9 Dec. 2009
Firstpage :
377
Lastpage :
386
Abstract :
Subspace clustering aims at detecting clusters in any subspace projection of a high dimensional space. As the number of possible subspace projections is exponential in the number of dimensions, the result is often tremendously large. Recent approaches fail to reduce results to relevant subspace clusters. Their results are typically highly redundant, i.e. many clusters are detected multiple times in several projections. In this work, we propose a novel model for relevant subspace clustering (RESCU). We present a global optimization which detects the most interesting non-redundant subspace clusters. We prove that computation of this model is NP-hard. For RESCU, we propose an approximative solution that shows high accuracy with respect to our relevance model. Thorough experiments on synthetic and real world data show that RESCU successfully reduces the result to manageable sizes. It reliably achieves top clustering quality while competing approaches show greatly varying performance.
Keywords :
data mining; optimisation; pattern clustering; NP-hard; clustering quality; global optimization; high dimensional data; non-redundant subspace clusters; nonredundant concepts; relevance model; relevant subspace clustering; subspace projections; Bioinformatics; Computational modeling; Data analysis; Data mining; Gene expression; Genomics; Object detection; Principal component analysis; Redundancy; Sensor phenomena and characterization; data mining; global optimization; high dimensional data; redundancy removal; subspace clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
Conference_Location :
Miami, FL
ISSN :
1550-4786
Print_ISBN :
978-1-4244-5242-2
Electronic_ISBN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2009.10
Filename :
5360263
Link To Document :
بازگشت