Title :
An Effective Approach to Inverse Frequent Set Mining
Author :
Guzzo, Antonella ; Sacca, D. ; Serra, Edoardo
Author_Institution :
DEIS, Univ. of Calabria, Rende, Italy
Abstract :
The inverse frequent set mining problem is the problem of computing a database on which a given collection of itemsets must emerge to be frequent. Earlier studies focused on investigating computational and approximability properties of this problem. In this paper, we face it under the pragmatic perspective of defining heuristic solution approaches that are effective and scalable in real scenarios. In particular, a general formulation of the problem is considered where minimum and maximum support constraints can be defined on each itemset, and where no bound is given beforehand on the size of the resulting output database. Within this setting, an algorithm is proposed that always satisfies the maximum support constraints, but which treats minimum support constraints as soft ones that are enforced as long as possible. A thorough experimentation evidences that minimum support constraints are hardly violated in practice, and that such negligible degradation in accuracy (which is unavoidable due to the theoretical intractability of the problem) is compensated by very good scaling performances.
Keywords :
data mining; database management systems; database computing; inverse frequent set mining problem; maximum support constraints; minimum support constraints; Computational complexity; Constraint theory; Data mining; Data privacy; Degradation; Frequency conversion; Frequency measurement; Itemsets; Particle measurements; Transaction databases; Complexity; Data Reconstruction; Inverse Frequent Mining;
Conference_Titel :
Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4244-5242-2
Electronic_ISBN :
1550-4786
DOI :
10.1109/ICDM.2009.123