• DocumentCode
    2865229
  • Title

    Approximate inverse frequent itemset mining: privacy, complexity, and approximation

  • Author

    Wang, Yongge ; Wu, Xintao

  • Author_Institution
    The Univ. of North Carolina at Charlotte, NC, USA
  • fYear
    2005
  • fDate
    27-30 Nov. 2005
  • Abstract
    In order to generate synthetic basket datasets for better benchmark testing, it is important to integrate characteristics from real-life databases into the synthetic basket datasets. The characteristics that could be used for this purpose include the frequent itemsets and association rules. The problem of generating synthetic basket datasets from frequent itemsets is generally referred to as inverse frequent itemset mining. In this paper, we show that the problem of approximate inverse frequent itemset mining is NP-complete. Then we propose and analyze an approximate algorithm for approximate inverse frequent itemset mining, and discuss privacy issues related to the synthetic basket dataset. In particular, we propose an approximate algorithm to determine the privacy leakage in a synthetic basket dataset.
  • Keywords
    approximation theory; computational complexity; data mining; data privacy; NP-complete problem; approximate inverse frequent itemset mining; approximation algorithm; association rules; benchmark testing; computational complexity; data privacy; real-life databases; synthetic basket dataset; Algorithm design and analysis; Association rules; Benchmark testing; Character generation; Data mining; Data privacy; Databases; Frequency; Itemsets; Linear programming; complexity; data mining; inverse frequent itemset mining; privacy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, Fifth IEEE International Conference on
  • ISSN
    1550-4786
  • Print_ISBN
    0-7695-2278-5
  • Type

    conf

  • DOI
    10.1109/ICDM.2005.27
  • Filename
    1565715