DocumentCode
3093437
Title
A framework for statistical data mining with summary tables
Author
Hou, Wen-Chi
Author_Institution
Dept. of Comput. Sci., Southern Illinois Univ., Carbondale, IL, USA
fYear
1999
fDate
36373
Firstpage
14
Lastpage
23
Abstract
Presents a framework for statistical data mining using summary tables. A set of operators is proposed for common data mining tasks, such as summarization, association, classification and clustering, as well as for basic statistical analysis, such as hypothesis testing, estimation and regression, which can help explore knowledge. The operators enable users to explore a variety of knowledge effectively and yet require users to have little statistical knowledge. Summary tables, which store basic information about groups of tuples of the underlying relations, are constructed to speed up the data mining process. The summary tables are incrementally updatable and are able to support a variety of data mining and statistical analysis tasks. The operators, together with the uses of the summary tables, can make interactive data mining flexible, effective, and perhaps instantaneous
Keywords
data mining; statistical databases; association; classification; clustering; data mining operators; estimation; hypothesis testing; incrementally updatable tables; interactive data mining; knowledge exploration; regression; statistical analysis; statistical data mining; summarization; summary tables; tuple groups; Data analysis; Data mining; Information theory; Machine learning; Measurement uncertainty; Psychology; Relational databases; Solids; Statistical analysis; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Scientific and Statistical Database Management, 1999. Eleventh International Conference on
Conference_Location
Cleveland, OH
Print_ISBN
0-7695-0046-3
Type
conf
DOI
10.1109/SSDM.1999.787617
Filename
787617
Link To Document