Title :
Online generation of association rules
Author :
Aggarwal, Charu C. ; Yu, Philip S.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
We have a large database consisting of sales transactions. We investigate the problem of online mining of association rules in this large database. We show how to preprocess the data effectively in order to make it suitable for repeated online queries. The preprocessing algorithm takes into account the storage space available. We store the preprocessed data in such a way that online processing may be done by applying a graph theoretic search algorithm whose complexity is proportional to the size of the output. This results in an online algorithm which is practically instantaneous in terms of response time. The algorithm also supports techniques for quickly discovering association rules from large item sets. The algorithm is capable of finding rules with specific items in the antecedent or consequent. These association rules are presented in a compact form, eliminating redundancy. We believe that the elimination of redundancy in online generation of association rules from large item sets is interesting in its own right
Keywords :
database theory; deductive databases; graph theory; knowledge acquisition; marketing data processing; query processing; search problems; very large databases; association rule online generation; complexity; data mining; data preprocessing; graph theoretic search algorithm; large database; online algorithm; online queries; redundancy; response time; sales transactions; storage space; Association rules; Data mining; Delay; Itemsets; Marketing and sales; Transaction databases;
Conference_Titel :
Data Engineering, 1998. Proceedings., 14th International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
0-8186-8289-2
DOI :
10.1109/ICDE.1998.655803