• DocumentCode
    1504617
  • Title

    An Application of Multivariate Statistical Analysis for Query-Driven Visualization

  • Author

    Gosink, Luke J. ; Garth, Christoph ; Anderson, John C. ; Bethel, E. Wes ; Joy, Kenneth I.

  • Author_Institution
    Pacific Northwest Nat. Lab., Battelle Memorial Inst., Richland, WA, USA
  • Volume
    17
  • Issue
    3
  • fYear
    2011
  • fDate
    3/1/2011 12:00:00 AM
  • Firstpage
    264
  • Lastpage
    275
  • Abstract
    Driven by the ability to generate ever-larger, increasingly complex data, there is an urgent need in the scientific community for scalable analysis methods that can rapidly identify salient trends in scientific data. Query-Driven Visualization (QDV) strategies are among the small subset of techniques that can address both large and highly complex data sets. This paper extends the utility of QDV strategies with a statistics-based framework that integrates nonparametric distribution estimation techniques with a new segmentation strategy to visually identify statistically significant trends and features within the solution space of a query. In this framework, query distribution estimates help users to interactively explore their query´s solution and visually identify the regions where the combined behavior of constrained variables is most important, statistically, to their inquiry. Our new segmentation strategy extends the distribution estimation analysis by visually conveying the individual importance of each variable to these regions of high statistical significance. We demonstrate the analysis benefits these two strategies provide and show how they maybe used to facilitate the refinement of constraints over variables expressed in a user´s query. We apply our method to data sets from two different scientific domains to demonstrate its broad applicability.
  • Keywords
    data visualisation; query processing; statistical analysis; QDV strategies; multivariate statistical analysis; new segmentation strategy; nonparametric distribution estimation techniques; query distribution; query-driven visualization; scientific data; Acceleration; Bandwidth; Data analysis; Data visualization; Information analysis; Kernel; Large-scale systems; Multidimensional systems; Statistical analysis; Visual databases; Query-driven visualization; kernel density estimation.; multivariate analysis; Computer Graphics; Data Interpretation, Statistical; Databases, Factual; Information Storage and Retrieval; Multivariate Analysis; User-Computer Interface;
  • fLanguage
    English
  • Journal_Title
    Visualization and Computer Graphics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1077-2626
  • Type

    jour

  • DOI
    10.1109/TVCG.2010.80
  • Filename
    5473228