Title :
Attribute-oriented induction using domain generalization graphs
Author :
Hamilton, Howard J. ; Hilderman, Robert J. ; Cercone, Nick
Author_Institution :
Dept. of Comput. Sci., Regina Univ., Sask., Canada
Abstract :
Attribute-oriented induction summarizes the information in a relational database by repeatedly replacing specific attribute values with more general concepts according to user-defined concept hierarchies. We show how domain generalization graphs can be constructed from multiple concept hierarchies associated with an attribute, describe how these graphs can be used to control the generalization of a set of attributes, and present the Multi-Attribute Generalization algorithm for attribute-oriented induction using domain generalization graphs. Based upon a generate-and-test approach, the algorithm generates all possible combinations of nodes from the domain generalization graphs associated with the individual attributes, to produce all possible generalized relations for the set of attributes. We rant the interestingness of the resulting generalized relations using measures based upon relative entropy and variance. Our experiments show that these measures provide a basis for analyzing summary data from relational databases. Variance appears more useful because it tends to rank the less complex generalized relations (i.e., those with few attributes and/or few tuples) as more interesting.
Keywords :
graph theory; inference mechanisms; knowledge acquisition; relational databases; attribute values; attribute-oriented induction; domain generalization graphs; generate-and-test approach; multiple concept hierarchies; relational database; user-defined concept hierarchies; Classification tree analysis; Computer science; Data analysis; Decision trees; Entropy; Frequency; Global Positioning System; Marketing and sales; Partitioning algorithms; Relational databases;
Conference_Titel :
Tools with Artificial Intelligence, 1996., Proceedings Eighth IEEE International Conference on
Print_ISBN :
0-8186-7686-7
DOI :
10.1109/TAI.1996.560458