• DocumentCode
    2772439
  • Title

    An Effective Approach to Inverse Frequent Set Mining

  • Author

    Guzzo, Antonella ; Sacca, D. ; Serra, Edoardo

  • Author_Institution
    DEIS, Univ. of Calabria, Rende, Italy
  • fYear
    2009
  • fDate
    6-9 Dec. 2009
  • Firstpage
    806
  • Lastpage
    811
  • Abstract
    The inverse frequent set mining problem is the problem of computing a database on which a given collection of itemsets must emerge to be frequent. Earlier studies focused on investigating computational and approximability properties of this problem. In this paper, we face it under the pragmatic perspective of defining heuristic solution approaches that are effective and scalable in real scenarios. In particular, a general formulation of the problem is considered where minimum and maximum support constraints can be defined on each itemset, and where no bound is given beforehand on the size of the resulting output database. Within this setting, an algorithm is proposed that always satisfies the maximum support constraints, but which treats minimum support constraints as soft ones that are enforced as long as possible. A thorough experimentation evidences that minimum support constraints are hardly violated in practice, and that such negligible degradation in accuracy (which is unavoidable due to the theoretical intractability of the problem) is compensated by very good scaling performances.
  • Keywords
    data mining; database management systems; database computing; inverse frequent set mining problem; maximum support constraints; minimum support constraints; Computational complexity; Constraint theory; Data mining; Data privacy; Degradation; Frequency conversion; Frequency measurement; Itemsets; Particle measurements; Transaction databases; Complexity; Data Reconstruction; Inverse Frequent Mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
  • Conference_Location
    Miami, FL
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4244-5242-2
  • Electronic_ISBN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2009.123
  • Filename
    5360315