• DocumentCode
    2544232
  • Title

    Parallel Method for Discovering Frequent Itemsets Using Weighted Tree Approach

  • Author

    Kumar, Preetham ; Ananthanarayana, V.S.

  • Author_Institution
    Dept. of Inf. & Commun. Technol., Manipal Inst. of Technol., Manipal
  • Volume
    1
  • fYear
    2009
  • fDate
    22-24 Jan. 2009
  • Firstpage
    124
  • Lastpage
    128
  • Abstract
    Every element of the transaction in a transaction database may contain the components such as item number, quantity, cost of the item bought and some other relevant information of the customer. Most of the association rules mining algorithms to discover frequent itemsets do not consider the components such as quantity, cost etc. In a large database it is possible that even if the itemset appears in a very few transactions, it may be purchased in a large quantity. Further, this may lead to very high profit. Therefore these components are the most important information and without which it may cause the lose of information. This motivated us to propose a parallel algorithm to discover all frequent itemsets based on the quantity of the item bought in a single scan of the database. This method achieves its efficiency by applying two new ideas. Firstly, transaction database is converted into an abstraction called weighted tree that prevents multiple scanning of the database during the mining phase. This data structure is replicated among the parallel nodes. Secondly, for each frequent item assigned to a parallel node, an item tree is constructed and frequent itemsets are mined from this tree based on weighted minimum support.
  • Keywords
    data mining; parallel algorithms; transaction processing; tree data structures; association rules mining; data structure; frequent itemset discovery; item quantity; parallel algorithm; transaction database; weighted minimum support; weighted tree; Association rules; Communications technology; Concurrent computing; Costs; Data engineering; Data mining; Itemsets; Parallel algorithms; Space technology; Transaction databases; Weight; attribute; component; cost; parallel;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Engineering and Technology, 2009. ICCET '09. International Conference on
  • Conference_Location
    Singapore
  • Print_ISBN
    978-1-4244-3334-6
  • Type

    conf

  • DOI
    10.1109/ICCET.2009.194
  • Filename
    4769439