• DocumentCode
    1553394
  • Title

    Mining associations with the collective strength approach

  • Author

    Aggarwal, Charu C. ; Yu, Philip S.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
  • Volume
    13
  • Issue
    6
  • fYear
    2001
  • Firstpage
    863
  • Lastpage
    873
  • Abstract
    The large itemset model has been proposed in the literature for finding associations in a large database of sales transactions. A different method for evaluating and finding itemsets referred to as strongly collective itemsets is proposed. We propose a criterion stressing the importance of the actual correlation of the items with one another rather than their absolute level of presence. Previous techniques for finding correlated itemsets are not necessarily applicable to very large databases. We provide an algorithm which provides very good computational efficiency, while maintaining statistical robustness. The fact that this algorithm relies on relative measures rather than absolute measures such as support also implies that the method can be applied to find association rules in data sets in which items may appear in a sizeable percentage of the transactions (dense data sets), data sets in which the items have varying density, or even negative association rules
  • Keywords
    data mining; marketing data processing; very large databases; association mining; collective strength approach; computational efficiency; item correlation; itemset model; large database; sales transactions; statistical robustness; strongly collective itemsets; transactions; Association rules; Computational efficiency; Consumer behavior; Data mining; Density measurement; Itemsets; Marketing and sales; Robustness; Size measurement; Transaction databases;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.971183
  • Filename
    971183