• DocumentCode
    2249888
  • Title

    A graph-based algorithm for frequent closed itemsets mining

  • Author

    Li, Li ; Zhai, Donghai ; Jin, Fan

  • Author_Institution
    Sch. of Comput. & Commun. Eng., Southwest Jiaotong Univ., Chengdu, China
  • fYear
    2003
  • fDate
    24-25 April 2003
  • Firstpage
    19
  • Lastpage
    24
  • Abstract
    Frequent itemsets mining plays an essential role in data mining, but it often generates a large number of redundant itemsets that reduce the efficiency of the mining task. Frequent closed itemsets are subset of frequent itemsets, but they contain all information of frequent itemsets. The most existing methods of frequent closed itemset mining are apriori-based. The efficiency of those methods is limited to the repeated database scan and the candidate set generation. We propose a graph-based algorithm for mining frequent closed itemsets called GFCG (graph-based frequent closed itemset generation). The new algorithm constructs an association graph to represent the frequent relationship between items, and recursively generates frequent closed itemset based on that graph. It scans the database for only two times, and avoids candidate set generation. GFCG outperforms a priori-based algorithm in experiment study and shows good performance both in speed and scale up properties.
  • Keywords
    data mining; graph theory; set theory; GFCG; apriori-based algorithm; association graph; candidate set generation; data mining; graph-based algorithm; graph-based frequent closed itemset generation; repeated database scan; Association rules; Data engineering; Data mining; Itemsets; Transaction databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems and Information Engineering Design Symposium, 2003 IEEE
  • Print_ISBN
    0-9744559-0-3
  • Type

    conf

  • DOI
    10.1109/SIEDS.2003.157999
  • Filename
    1242394