Title :
Discovering representative episodal association rules from event sequences using frequent closed episode sets and event constraints
Author :
Harms, Sherri K. ; Deogun, Jitender ; Saquer, Jamil ; Tadesse, Tsegaye
Author_Institution :
Dept. of Comput. Sci. & Comput. Eng., Nebraska Univ., Lincoln, NE, USA
Abstract :
Discovering association rules from time-series data is an important data mining problem. The number of potential rules grows quickly as the number of items in the antecedent grows. It is therefore difficult for an expert to analyze the rules and identify the useful. An approach for generating representative association rules for transactions that uses only a subset of the set of frequent itemsets called frequent closed itemsets was presented by Saquer and Deogun (2000). We employ formal concept analysis to develop the notion of frequent closed episodes. The concept of representative association rules is formalized in the context of event sequences. Applying constraints to target highly, significant rules further reduces the number of rules. Our approach results in a significant reduction of the number of rules generated, while maintaining the minimum set of relevant association rules and retaining the ability to generate the entire set of association rules with respect to the given constraints. We show how our method can be used to discover associations in a drought risk management decision support system and use multiple climatology datasets related to automated weather stations
Keywords :
climatology; data mining; decision support systems; geophysics computing; hydrology; risk management; time series; automated weather stations; data mining; drought risk management decision support system; event constraints; event sequences; formal concept analysis; frequent closed episode sets; frequent itemsets; multiple climatology datasets; representative episodal association rule discovery; time series data; transactions; Association rules; Clustering algorithms; Data mining; Decision support systems; Government; Itemsets; Pattern analysis; Risk management; Time measurement;
Conference_Titel :
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-1119-8
DOI :
10.1109/ICDM.2001.989576