DocumentCode :
3471574
Title :
Association Rules Mining Based on Statistical Correlation
Author :
Jian Hu ; Xiang Yang-Li
Author_Institution :
Sch. of Manage., Harbin Inst. of Technol., Harbin
fYear :
2008
fDate :
12-14 Oct. 2008
Firstpage :
1
Lastpage :
4
Abstract :
Association rules mining is an important data mining task, this paper emphatically analyzes realization skill and defects of existing algorithms. On the basis, a novel measure, named statistical correlation, which can indicate the correlation degree of multi-items in a rule, is put forward to cut association rules with independent or negative correlation, and its concept, calculating formulas and primary characteristics are defined. In order to conveniently use statistical correlation to cut redundancy rules, a fast association rules mining algorithm, called F-Fminer, is designed with two new data structures UFP-Tree and FP-Forest which use multi-trees structure to store data. F-Fminer adopts divide and conquer strategy to mine frequent itemsets for every UFP-Tree basing on depth-first searching. It can be seen from experimentation that the method in this paper has greatly enhanced mining efficiency and reduced a lot of redundant rules than other classical algorithms.
Keywords :
data mining; statistical analysis; tree data structures; F-Fminer; FP-Forest; UFP-Tree; association rules mining; data mining task; data structure; depth-first searching; divide-and-conquer strategy; statistical correlation; Algorithm design and analysis; Association rules; Costs; Data mining; Data structures; Itemsets; Partitioning algorithms; Technology management; Transaction databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Wireless Communications, Networking and Mobile Computing, 2008. WiCOM '08. 4th International Conference on
Conference_Location :
Dalian
Print_ISBN :
978-1-4244-2107-7
Electronic_ISBN :
978-1-4244-2108-4
Type :
conf
DOI :
10.1109/WiCom.2008.2571
Filename :
4680760
Link To Document :
بازگشت