• DocumentCode
    3541478
  • Title

    Estimating intrinsic dimension via clustering

  • Author

    Eriksson, Brian ; Crovella, Mark

  • Author_Institution
    Dept. of Comput. Sci., Boston Univ., Boston, MA, USA
  • fYear
    2012
  • fDate
    5-8 Aug. 2012
  • Firstpage
    760
  • Lastpage
    763
  • Abstract
    Estimating the intrinsic dimension of a data set from pairwise distances is a critical issue for a wide range of disciplines, including genomics, finance, and networking. Current estimation techniques are agnostic to structure in the data, failing to exploit properties that can improve efficiency. In this paper, we present a methodology that uses inherent clustering present in data to efficiently and accurately estimate intrinsic dimension. Our experiments show that this approach has greater accuracy and better scalability than prior techniques, even when the data does not conform to an obvious clustering structure.
  • Keywords
    data analysis; estimation theory; pattern clustering; clustering structure; data analysis problems; estimation techniques; intrinsic dimension; pairwise distances; Bioinformatics; Clustering algorithms; Computational complexity; Couplings; Fractals; Maximum likelihood estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Statistical Signal Processing Workshop (SSP), 2012 IEEE
  • Conference_Location
    Ann Arbor, MI
  • ISSN
    pending
  • Print_ISBN
    978-1-4673-0182-4
  • Electronic_ISBN
    pending
  • Type

    conf

  • DOI
    10.1109/SSP.2012.6319815
  • Filename
    6319815