Title :
Efficiency and Consistency Study on Carma
Author :
Huang, Yuan ; Wang, Xing ; Shia, Ben-Chang
Author_Institution :
Sch. of Stat., Renmin Univ. of China, Beijing, China
Abstract :
Carma is a type of online association algorithm, designed to facilitate association rule with online data flow and successively changing support thresholds. In this paper we study the factors that contribute to the efficiency of Carma and how data flow distribution give effects on the performance of Carma. We design several experiments with two kinds of data. In fixed support threshold situations, we compare Carma with that of Apriori. We find the sets generated by Carma are subsets of those generated by Apriori. We find that if the support threshold is reasonably defined, these two algorithms reach the same results. On the other hand, as the support threshold increases, Phase I generates less items and the number of deleted sets from Phase II first increases and then declines. Carma behaves consistently towards changing support. We notice the earlier the items enter into a lattice, the more accurate the estimations are. If base stone elements show up early in the transaction, the performance of Phase II is mainly influenced by the late-entered item sets. Based on the discussion with Carma, we propose a new procedure to improve Carma. Simulations reveal that the modified algorithm works well.
Keywords :
data flow computing; data mining; interactive programming; Carma; association rule; data flow distribution; online association algorithm; online data flow; Algorithm design and analysis; Association rules; Data mining; Information science; Itemsets; Lattices; Partitioning algorithms; Statistical distributions; Statistics; Transaction databases; Apriori; Association rule algorithm; Carma;
Conference_Titel :
INC, IMS and IDC, 2009. NCM '09. Fifth International Joint Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4244-5209-5
Electronic_ISBN :
978-0-7695-3769-6
DOI :
10.1109/NCM.2009.241