• DocumentCode
    3106242
  • Title

    AC-Close: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery

  • Author

    Hong Cheng ; Yu, Philip S. ; Jiawei Han

  • Author_Institution
    Univ. of Illinois at Urbana-Champaign, Urbana, IL
  • fYear
    2006
  • fDate
    18-22 Dec. 2006
  • Firstpage
    839
  • Lastpage
    844
  • Abstract
    Recent studies have proposed methods to discover approximate frequent itemsets in the presence of random noise. By relaxing the rigid requirement of exact frequent pattern mining, some interesting patterns, which would previously be fragmented by exact pattern mining methods due to the random noise or measurement error, are successfully recovered. Unfortunately, a large number of "uninteresting" candidates are explored as well during the mining process, as a result of the relaxed pattern mining methodology. This severely slows down the mining process. Even worse, it is hard for an end user to distinguish the recovered interesting patterns from these uninteresting ones. In this paper, we propose an efficient algorithm AC-Close to recover the approximate closed itemsets from "core patterns". By focusing on the so-called core patterns, integrated with a top-down mining and several effective pruning strategies, the algorithm narrows down the search space to those potentially interesting ones. Experimental results show that AC-Close substantially outperforms the previously proposed method in terms of efficiency, while delivers a similar set of interesting recovered patterns.
  • Keywords
    data mining; search problems; AC-Close algorithm; approximate closed itemset mining; core pattern recovery; frequent pattern mining; interesting patterns; measurement error; pruning strategies; random noise; search space; top-down mining; transaction databases; uninteresting candidates; Algorithm design and analysis; Data mining; Itemsets; Measurement errors; Transaction databases; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2006. ICDM '06. Sixth International Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1550-4786
  • Print_ISBN
    0-7695-2701-7
  • Type

    conf

  • DOI
    10.1109/ICDM.2006.10
  • Filename
    4053113