Title :
MOTC: an aid to multidimensional hypothesis generation
Author :
Balachandran, E. ; Buzydlowski, J. ; Dworman, G. ; Kimbrough, S.O. ; Rosengarten, E. ; Shafer, T. ; Vachula, W.
Author_Institution :
Pennsylvania Univ., Philadelphia, PA, USA
Abstract :
Reports on conceptual development in the areas of database mining and knowledge discovery in databases (KDD). Our efforts have also led to a prototype implementation, called MOTC, for exploring hypothesis spaces in large and complex data sets. Our KDD conceptual development rests on two main principles. First, we use the crosstab representation for working with qualitative data. This is by now standard practice in OLAP (online analytical processing) applications, and we reaffirm it with additional reasons. Second, and innovatively, we use prediction analysis as a measure of goodness for hypotheses. Prediction analysis is an established statistical technique for analysis of associations among qualitative variables. It generalizes and subsumes a large number of other such measures of association, depending upon the specific assumptions the user is willing to make. As such, it provides a very useful framework for exploring the hypothesis space in a KDD context. This paper illustrates these points with an extensive discussion of MOTC
Keywords :
deductive databases; heuristic programming; knowledge acquisition; statistical analysis; very large databases; MOTC; complex data sets; conceptual development; crosstab representation; database mining; databases; hypothesis goodness measure; hypothesis space exploration; knowledge discovery; multidimensional hypothesis generation; online analytical processing; prediction analysis; prototype implementation; qualitative data; qualitative variable associations; statistical technique; Decision support systems; Multidimensional systems; Prototypes; Pulp and paper industry; Regression analysis; Seminars; Software prototyping; Software standards; Space exploration; Technological innovation; Transaction databases;
Conference_Titel :
System Sciences, 1998., Proceedings of the Thirty-First Hawaii International Conference on
Conference_Location :
Kohala Coast, HI
Print_ISBN :
0-8186-8255-8
DOI :
10.1109/HICSS.1998.654759