DocumentCode :
2339310
Title :
Exploring high-D spaces with multiform matrices and small multiples
Author :
MacEachren, Alan ; Xiping, D. ; Hardisty, Frank ; Guo, Diansheng ; Lengerich, Gene
Author_Institution :
Dept. of Geogr., Pennsylvania State Univ., University Park, VA, USA
fYear :
2003
fDate :
21-21 Oct. 2003
Firstpage :
31
Lastpage :
38
Abstract :
We introduce an approach to visual analysis of multivariate data that integrates several methods from information visualization, exploratory data analysis (EDA), and geovisualization. The approach leverages the component-based architecture implemented in GeoVISTA Studio to construct a flexible, multiview, tightly (but generically) coordinated, EDA toolkit. This toolkit builds upon traditional ideas behind both small multiples and scatterplot matrices in three fundamental ways. First, we develop a general, multiform, bivariate matrix and a complementary multiform, bivariate small multiple plot in which different bivariate representation forms can be used in combination. We demonstrate the flexibility of this approach with matrices and small multiples that depict multivariate data through combinations of: scatterplots, bivariate maps, and space-filling displays. Second, we apply a measure of conditional entropy to (a) identify variables from a high-dimensional data set that are likely to display interesting relationships and (b) generate a default order of these variables in the matrix or small multiple display. Third, we add conditioning, a kind of dynamic query/filtering in which supplementary (undisplayed) variables are used to constrain the view onto variables that are displayed. Conditioning allows the effects of one or more well understood variables to be removed form the analysis, making relationships among remaining variables easier to explore. We illustrate the individual and combined functionality enabled by this approach through application to analysis of cancer diagnosis and mortality data and their associated covariates and risk factors.
Keywords :
data analysis; data mining; data visualisation; geographic information systems; medical diagnostic computing; object-oriented programming; pattern recognition; visual databases; EDA; GeoVISTA Studio; bivariate maps; bivariate matrix; bivariate representation; bivarite small multiple plot; cancer diagnosis; component-based architecture; conditional entropy; default order; exploratory data analysis; geovisualization; high-dimensional data set; high-dimensional space exploration; information visualization; mortality data; multiform matrices; multiple display; multivariate data; risk factors; scatterplot matrices; scatterplot matrix; scatterplots; small multiples; space-filling displays; visual analysis; Cancer; Data analysis; Data visualization; Displays; Electronic design automation and methodology; Entropy; Filtering; Information analysis; Scattering; Space exploration;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Visualization, 2003. INFOVIS 2003. IEEE Symposium on
Conference_Location :
Seattle, WA, USA
Print_ISBN :
0-7803-8154-8
Type :
conf
DOI :
10.1109/INFVIS.2003.1249006
Filename :
1249006
Link To Document :
بازگشت