شماره ركورد
20432
عنوان به زبان ديگر
An Efficient Sampling Approach for Mining all Association Rules in Large Databases
پديد آورندگان
Deypir M نويسنده , Sadreddini M H نويسنده
از صفحه
73
تا صفحه
78
تعداد صفحه
6
چكيده لاتين
Mining association rules is an interesting problem in the field of knowledge discovery and data mining. In this paper a fast, new method for mining association rules is presented. In this new sampling approach, which utilizes FP-Growth to mine sample data, the candidate generation and test in the sample data is omitted because of the FP-Tree projection of sample data in the main memory. To overcome the main memory limitation in the new sampling method, a useful technique is proposed. Theoretical time consideration and empirical evaluation show that the new sampling approach is superior to the traditional Apriori-based sampling by orders of magnitude. Experimental evaluations on artificial and real-life datasets show that our approach, compared with a previously proposed sampling algorithm, is more efficient when the minimal support threshold is decreased and is also more stable as the size of the sample data increases.
شماره مدرك
1204469
لينک به اين مدرک