• DocumentCode
    1302831
  • Title

    Data signatures and visualization of scientific data sets

  • Author

    Wong, Pak Chunp ; Foote, H. ; Leung, Ruby ; Adams, David ; Thomas, Jim

  • Author_Institution
    Pacific Northwest Nat. Lab., USA
  • Volume
    20
  • Issue
    2
  • fYear
    2000
  • Firstpage
    12
  • Lastpage
    15
  • Abstract
    Today, as data sets used in computations grow in size and complexity, the technologies developed over the years to deal with scientific data sets have become less efficient and effective. Many frequently used operations, such as eigenvector computation, could quickly exhaust our desktop workstations once the data size reaches certain limits. On the other hand, the high-dimensional data sets we collect every day don´t relieve the problem. Many conventional metric designs that build on quantitative or categorical data sets cannot be applied directly to heterogeneous data sets with multiple data types. While building new machines with more resources might conquer the data size problems, the complexity of today´s computations requires a new breed of projection techniques to support analysis of the data and verification of the results. We introduce the concept of a data signature, which captures the essence of a scientific data set in a compact format, and use it to conduct analysis as if using the original. A time-dependent climate simulation data set demonstrates our approach and presents the results
  • Keywords
    climatology; computational complexity; data structures; data visualisation; geophysics computing; scientific information systems; categorical data sets; data sets; data signature; data signatures; data size; data size problems; desktop workstations; eigenvector computation; heterogeneous data sets; high-dimensional data sets; metric designs; multiple data types; projection techniques; scientific data set; scientific data set visualization; time-dependent climate simulation data set; Buildings; Combustion; Computational modeling; Data visualization; Laboratories; Research and development; Tensile stress; Text analysis; Text recognition; Workstations;
  • fLanguage
    English
  • Journal_Title
    Computer Graphics and Applications, IEEE
  • Publisher
    ieee
  • ISSN
    0272-1716
  • Type

    jour

  • DOI
    10.1109/38.824451
  • Filename
    824451