• DocumentCode
    43936
  • Title

    A Structure-Based Distance Metric for High-Dimensional Space Exploration with Multidimensional Scaling

  • Author

    Lee, J.H. ; McDonnell, Kevin T. ; Zelenyuk, Alla ; Imre, Dan ; Mueller, Klaus

  • Author_Institution
    Dept. of Comput. Sci., Stony Brook Univ., Stony Brook, NY, USA
  • Volume
    20
  • Issue
    3
  • fYear
    2014
  • fDate
    Mar-14
  • Firstpage
    351
  • Lastpage
    364
  • Abstract
    Although the euclidean distance does well in measuring data distances within high-dimensional clusters, it does poorly when it comes to gauging intercluster distances. This significantly impacts the quality of global, low-dimensional space embedding procedures such as the popular multidimensional scaling (MDS) where one can often observe nonintuitive layouts. We were inspired by the perceptual processes evoked in the method of parallel coordinates which enables users to visually aggregate the data by the patterns the polylines exhibit across the dimension axes. We call the path of such a polyline its structure and suggest a metric that captures this structure directly in high-dimensional space. This allows us to better gauge the distances of spatially distant data constellations and so achieve data aggregations in MDS plots that are more cognizant of existing high-dimensional structure similarities. Our biscale framework distinguishes far-distances from near-distances. The coarser scale uses the structural similarity metric to separate data aggregates obtained by prior classification or clustering, while the finer scale employs the appropriate euclidean distance.
  • Keywords
    computational geometry; data visualisation; embedded systems; high-dimensional space exploration; low-dimensional space embedding procedures; multidimensional scaling; polylines pattern; spatially distant data constellations; structure-based distance metric; Correlation; Data visualization; Euclidean distance; Extraterrestrial measurements; Indexes; Layout; Information visualization; clustering; high-dimensional data; multivariate visualization; visual analytics;
  • fLanguage
    English
  • Journal_Title
    Visualization and Computer Graphics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1077-2626
  • Type

    jour

  • DOI
    10.1109/TVCG.2013.101
  • Filename
    6560006