DocumentCode
3471574
Title
Association Rules Mining Based on Statistical Correlation
Author
Jian Hu ; Xiang Yang-Li
Author_Institution
Sch. of Manage., Harbin Inst. of Technol., Harbin
fYear
2008
fDate
12-14 Oct. 2008
Firstpage
1
Lastpage
4
Abstract
Association rules mining is an important data mining task, this paper emphatically analyzes realization skill and defects of existing algorithms. On the basis, a novel measure, named statistical correlation, which can indicate the correlation degree of multi-items in a rule, is put forward to cut association rules with independent or negative correlation, and its concept, calculating formulas and primary characteristics are defined. In order to conveniently use statistical correlation to cut redundancy rules, a fast association rules mining algorithm, called F-Fminer, is designed with two new data structures UFP-Tree and FP-Forest which use multi-trees structure to store data. F-Fminer adopts divide and conquer strategy to mine frequent itemsets for every UFP-Tree basing on depth-first searching. It can be seen from experimentation that the method in this paper has greatly enhanced mining efficiency and reduced a lot of redundant rules than other classical algorithms.
Keywords
data mining; statistical analysis; tree data structures; F-Fminer; FP-Forest; UFP-Tree; association rules mining; data mining task; data structure; depth-first searching; divide-and-conquer strategy; statistical correlation; Algorithm design and analysis; Association rules; Costs; Data mining; Data structures; Itemsets; Partitioning algorithms; Technology management; Transaction databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Wireless Communications, Networking and Mobile Computing, 2008. WiCOM '08. 4th International Conference on
Conference_Location
Dalian
Print_ISBN
978-1-4244-2107-7
Electronic_ISBN
978-1-4244-2108-4
Type
conf
DOI
10.1109/WiCom.2008.2571
Filename
4680760
Link To Document