Title :
Estimating intrinsic dimension via clustering
Author :
Eriksson, Brian ; Crovella, Mark
Author_Institution :
Dept. of Comput. Sci., Boston Univ., Boston, MA, USA
Abstract :
Estimating the intrinsic dimension of a data set from pairwise distances is a critical issue for a wide range of disciplines, including genomics, finance, and networking. Current estimation techniques are agnostic to structure in the data, failing to exploit properties that can improve efficiency. In this paper, we present a methodology that uses inherent clustering present in data to efficiently and accurately estimate intrinsic dimension. Our experiments show that this approach has greater accuracy and better scalability than prior techniques, even when the data does not conform to an obvious clustering structure.
Keywords :
data analysis; estimation theory; pattern clustering; clustering structure; data analysis problems; estimation techniques; intrinsic dimension; pairwise distances; Bioinformatics; Clustering algorithms; Computational complexity; Couplings; Fractals; Maximum likelihood estimation;
Conference_Titel :
Statistical Signal Processing Workshop (SSP), 2012 IEEE
Conference_Location :
Ann Arbor, MI
Print_ISBN :
978-1-4673-0182-4
Electronic_ISBN :
pending
DOI :
10.1109/SSP.2012.6319815