• DocumentCode
    3471574
  • Title

    Association Rules Mining Based on Statistical Correlation

  • Author

    Jian Hu ; Xiang Yang-Li

  • Author_Institution
    Sch. of Manage., Harbin Inst. of Technol., Harbin
  • fYear
    2008
  • fDate
    12-14 Oct. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Association rules mining is an important data mining task, this paper emphatically analyzes realization skill and defects of existing algorithms. On the basis, a novel measure, named statistical correlation, which can indicate the correlation degree of multi-items in a rule, is put forward to cut association rules with independent or negative correlation, and its concept, calculating formulas and primary characteristics are defined. In order to conveniently use statistical correlation to cut redundancy rules, a fast association rules mining algorithm, called F-Fminer, is designed with two new data structures UFP-Tree and FP-Forest which use multi-trees structure to store data. F-Fminer adopts divide and conquer strategy to mine frequent itemsets for every UFP-Tree basing on depth-first searching. It can be seen from experimentation that the method in this paper has greatly enhanced mining efficiency and reduced a lot of redundant rules than other classical algorithms.
  • Keywords
    data mining; statistical analysis; tree data structures; F-Fminer; FP-Forest; UFP-Tree; association rules mining; data mining task; data structure; depth-first searching; divide-and-conquer strategy; statistical correlation; Algorithm design and analysis; Association rules; Costs; Data mining; Data structures; Itemsets; Partitioning algorithms; Technology management; Transaction databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Wireless Communications, Networking and Mobile Computing, 2008. WiCOM '08. 4th International Conference on
  • Conference_Location
    Dalian
  • Print_ISBN
    978-1-4244-2107-7
  • Electronic_ISBN
    978-1-4244-2108-4
  • Type

    conf

  • DOI
    10.1109/WiCom.2008.2571
  • Filename
    4680760