• DocumentCode
    1636943
  • Title

    An Efficient Approximate Approach to Mining Frequent Itemsets over High Speed Transactional Data Streams

  • Author

    Kuen-Fang Jea ; Chao-Wei Li ; Tsui-Ping Chang

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Nat. Chung-Hsing Univ., Taichung
  • Volume
    3
  • fYear
    2008
  • Firstpage
    275
  • Lastpage
    280
  • Abstract
    A data stream is a massive and unbounded sequence of data elements that are continuously generated at a fast speed. Compared with traditional data mining, knowledge discovery in data streams is more challenging since several requirements need to be satisfied. In this paper we propose a mining algorithm for finding frequent itemsets over a transactional data stream. Unlike most of existing algorithms, our method works based on the theory of Approximate Inclusion-Exclusion to approximate the itemsets´ counts. Some techniques are designed and integrated into the algorithm for performance improvement. And the performance of the proposed algorithm is tested and analyzed through several experiments.
  • Keywords
    data mining; approximate inclusion-exclusion; data mining; frequent itemset mining algorithm; knowledge discovery; transactional data streams; Algorithm design and analysis; Application software; Chaos; Computer science; Data engineering; Data mining; Data structures; Design engineering; Intelligent systems; Itemsets; approximate approach; combinatorial approximation; data stream; frequent itemsets mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications, 2008. ISDA '08. Eighth International Conference on
  • Conference_Location
    Kaohsiung
  • Print_ISBN
    978-0-7695-3382-7
  • Type

    conf

  • DOI
    10.1109/ISDA.2008.74
  • Filename
    4696474