Title : 
Subspace clustering via thresholding and spectral clustering
         
        
            Author : 
Heckel, Reinhard ; Bolcskei, Helmut
         
        
            Author_Institution : 
Dept. of IT & EE, ETH Zurich, Zurich, Switzerland
         
        
        
        
        
            Abstract : 
We consider the problem of clustering a set of high-dimensional data points into sets of low-dimensional linear subspaces. The number of subspaces, their dimensions, and their orientations are unknown. We propose a simple and low-complexity clustering algorithm based on thresholding the correlations between the data points followed by spectral clustering. A probabilistic performance analysis shows that this algorithm succeeds even when the subspaces intersect, and when the dimensions of the subspaces scale (up to a log-factor) linearly in the ambient dimension. Moreover, we prove that the algorithm also succeeds for data points that are subject to erasures with the number of erasures scaling (up to a log-factor) linearly in the ambient dimension. Finally, we propose a simple scheme that provably detects outliers.
         
        
            Keywords : 
data handling; pattern clustering; probability; high dimensional data points; low dimensional linear subspaces; low-complexity clustering algorithm; probabilistic performance analysis; spectral clustering; subspace clustering; thresholding clustering; Algorithm design and analysis; Clustering algorithms; Computer vision; Correlation; Heart; Probabilistic logic; Vectors; erasures; outlier detection; principal angles; spectral clustering; subspace clustering;
         
        
        
        
            Conference_Titel : 
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
         
        
            Conference_Location : 
Vancouver, BC
         
        
        
        
            DOI : 
10.1109/ICASSP.2013.6638261