Title :
An Application of Multivariate Statistical Analysis for Query-Driven Visualization
Author :
Gosink, Luke J. ; Garth, Christoph ; Anderson, John C. ; Bethel, E. Wes ; Joy, Kenneth I.
Author_Institution :
Pacific Northwest Nat. Lab., Battelle Memorial Inst., Richland, WA, USA
fDate :
3/1/2011 12:00:00 AM
Abstract :
Driven by the ability to generate ever-larger, increasingly complex data, there is an urgent need in the scientific community for scalable analysis methods that can rapidly identify salient trends in scientific data. Query-Driven Visualization (QDV) strategies are among the small subset of techniques that can address both large and highly complex data sets. This paper extends the utility of QDV strategies with a statistics-based framework that integrates nonparametric distribution estimation techniques with a new segmentation strategy to visually identify statistically significant trends and features within the solution space of a query. In this framework, query distribution estimates help users to interactively explore their query´s solution and visually identify the regions where the combined behavior of constrained variables is most important, statistically, to their inquiry. Our new segmentation strategy extends the distribution estimation analysis by visually conveying the individual importance of each variable to these regions of high statistical significance. We demonstrate the analysis benefits these two strategies provide and show how they maybe used to facilitate the refinement of constraints over variables expressed in a user´s query. We apply our method to data sets from two different scientific domains to demonstrate its broad applicability.
Keywords :
data visualisation; query processing; statistical analysis; QDV strategies; multivariate statistical analysis; new segmentation strategy; nonparametric distribution estimation techniques; query distribution; query-driven visualization; scientific data; Acceleration; Bandwidth; Data analysis; Data visualization; Information analysis; Kernel; Large-scale systems; Multidimensional systems; Statistical analysis; Visual databases; Query-driven visualization; kernel density estimation.; multivariate analysis; Computer Graphics; Data Interpretation, Statistical; Databases, Factual; Information Storage and Retrieval; Multivariate Analysis; User-Computer Interface;
Journal_Title :
Visualization and Computer Graphics, IEEE Transactions on
DOI :
10.1109/TVCG.2010.80