Title of article :
Itemset generalization with cardinality-based constraints
Author/Authors :
Luca Cagliero، نويسنده , , Paolo Garza، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2013
Pages :
14
From page :
161
To page :
174
Abstract :
Generalized itemset mining is an established data mining technique that focuses on discovering high-level correlations among large databases. By exploiting a taxonomy built over the data items, items are aggregated into higher level concepts and, thus, data correlations at different abstraction levels can be discovered. However, since a large number of patterns can be extracted, the result of the mining process is often not easily manageable by domain experts. We propose a novel approach to discovering a compact subset of generalized itemsets from structured data. To guarantee model conciseness and readability, a set of itemsets that has a common generalization is generated only when its cardinality is so small that its manual inspection is practically feasible. Furthermore, generalizations are generated only when their knowledge is covered by a large number of low-level descendant itemsets, and the generalizations are worth considering in place of their many low-level descendants only in these cases. Experiments performed on synthetic, benchmark, and real data taken from a mobile application scenario demonstrate the effectiveness and efficiency of the proposed approach.
Keywords :
Mobile data analysis , DATA MINING , Generalized itemset mining
Journal title :
Information Sciences
Serial Year :
2013
Journal title :
Information Sciences
Record number :
1215743
Link To Document :
بازگشت