• DocumentCode
    3180732
  • Title

    Analysis on probabilistic and binary datasets through frequent itemset mining

  • Author

    Bhadoria, Robin Singh ; Kumar, Ram ; Dixit, Manish

  • Author_Institution
    Dept. of CSE, IITM, Gwalior, India
  • fYear
    2011
  • fDate
    11-14 Dec. 2011
  • Firstpage
    263
  • Lastpage
    267
  • Abstract
    Association rule mining is the process of discovering relationships among the data items in large database. It is one of the most important problems in the field of data mining. Finding frequent itemsets is one of the most computationally expensive tasks in association rule mining. The classical frequent itemset mining approaches mine the frequent itemsets from the database where presence of an item in a transaction is certain. Frequent itemset mining under uncertain data model is a new area of research. In this case the presence of an item is given by some likelihood measure. In this paper, we have developed a hyper structure based pattern growth method for frequent itemset mining from uncertain data. We have also developed a maximal clique based candidate pruning method for uncertain data. We have implemented and analyzed the performance of the well known algorithms for frequent itemset mining for both binary and uncertain data model. Our empirical results show that in case of dense binary datasets, FP-growth outperforms all other algorithms, whereas in case of sparse data H-mine outperforms other algorithms.
  • Keywords
    data analysis; data mining; database management systems; association rule mining; binary dataset; data mining; database; dataset analysis; frequent itemset mining; maximal clique based candidate pruning method; pattern growth method; probabilistic dataset; Association rules; Computer science; Itemsets; Probabilistic logic; USA Councils;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information and Communication Technologies (WICT), 2011 World Congress on
  • Conference_Location
    Mumbai
  • Print_ISBN
    978-1-4673-0127-5
  • Type

    conf

  • DOI
    10.1109/WICT.2011.6141255
  • Filename
    6141255