Title :
Visualization techniques for mining large databases: a comparison
Author :
Keim, Daniel A. ; Kriegel, Hans-Peter
Author_Institution :
Inst. for Comput. Sci., Munchen Univ., Germany
fDate :
12/1/1996 12:00:00 AM
Abstract :
Visual data mining techniques have proven to be of high value in exploratory data analysis, and they also have a high potential for mining large databases. In this article, we describe and evaluate a new visualization-based approach to mining large databases. The basic idea of our visual data mining techniques is to represent as many data items as possible on the screen at the same time by mapping each data value to a pixel of the screen and arranging the pixels adequately. The major goal of this article is to evaluate our visual data mining techniques and to compare them to other well-known visualization techniques for multidimensional data: the parallel coordinate and stick-figure visualization techniques. For the evaluation of visual data mining techniques, the perception of data properties counts most, while the CPU time and the number of secondary storage accesses are only of secondary importance. In addition to testing the visualization techniques using real data, we developed a testing environment for database visualizations similar to the benchmark approach used for comparing the performance of database systems. The testing environment allows the generation of test data sets with predefined data characteristics which are important for comparing the perceptual abilities of visual data mining techniques
Keywords :
data analysis; data visualisation; deductive databases; knowledge acquisition; program testing; very large databases; CPU time; data items representation; data properties perception; data value-pixel mapping; exploratory data analysis; large databases; multidimensional data; multidimensional multivariate data; parallel coordinate technique; perceptual abilities; predefined data characteristics; secondary storage accesses; stick-figure technique; test data set generation; testing environment; visual data mining techniques; visualization techniques; Benchmark testing; Central Processing Unit; Computer Society; Data analysis; Data mining; Data visualization; Database systems; Multidimensional systems; System testing; Visual databases;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on