DocumentCode
2612281
Title
Attribute-oriented induction using domain generalization graphs
Author
Hamilton, Howard J. ; Hilderman, Robert J. ; Cercone, Nick
Author_Institution
Dept. of Comput. Sci., Regina Univ., Sask., Canada
fYear
1996
fDate
16-19 Nov. 1996
Firstpage
246
Lastpage
253
Abstract
Attribute-oriented induction summarizes the information in a relational database by repeatedly replacing specific attribute values with more general concepts according to user-defined concept hierarchies. We show how domain generalization graphs can be constructed from multiple concept hierarchies associated with an attribute, describe how these graphs can be used to control the generalization of a set of attributes, and present the Multi-Attribute Generalization algorithm for attribute-oriented induction using domain generalization graphs. Based upon a generate-and-test approach, the algorithm generates all possible combinations of nodes from the domain generalization graphs associated with the individual attributes, to produce all possible generalized relations for the set of attributes. We rant the interestingness of the resulting generalized relations using measures based upon relative entropy and variance. Our experiments show that these measures provide a basis for analyzing summary data from relational databases. Variance appears more useful because it tends to rank the less complex generalized relations (i.e., those with few attributes and/or few tuples) as more interesting.
Keywords
graph theory; inference mechanisms; knowledge acquisition; relational databases; attribute values; attribute-oriented induction; domain generalization graphs; generate-and-test approach; multiple concept hierarchies; relational database; user-defined concept hierarchies; Classification tree analysis; Computer science; Data analysis; Decision trees; Entropy; Frequency; Global Positioning System; Marketing and sales; Partitioning algorithms; Relational databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence, 1996., Proceedings Eighth IEEE International Conference on
ISSN
1082-3409
Print_ISBN
0-8186-7686-7
Type
conf
DOI
10.1109/TAI.1996.560458
Filename
560458
Link To Document